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IFN- ALPHA HOMOLOGUES 



5 COPYRIGHT NOTIFICATION 

Pursuant to 37 C.F.R. 1.71(e), a portion of this patent document contains 

material which is subject to copyright protection. The copyright owner has no objection to 

the facsimile reproduction by anyone of the patent document or the patent disclosure, as it 

appears in the Patent and Trademark Office patent file or records, but otherwise reserves 

10 all copyright rights whatsoever. 

CROSS REFERENCE TO RELATED APPLICATIONS 

This application is a continuation-in-part application of and claims the 

benefit of and priority to U.S. Patent Application Serial No. 09/145,483, filed October 7, 

1999. the disclosure of which is incorporated herein by reference in its entirety for all 

15 purposes. 

FIELD OF THE INVENTION 
The present invention relates to the generation of new interferon-alpha 

homologues. 

BACKGROUND OF THE INVENTION 

20 Interferon-alphas are members of the diverse helical-bundle superfamily of 

cytokine genes (Sprang, S.R. et ai (1993) Curr. Opin. Struct. Biol 3:815-827). The 
human interferon-alphas are encoded by a family of over 20 tandemly duplicated 
nonallelic genes that share 85-98% sequence identity at the amino acid level (Henco, K. et 
al (1985)7. MoL BioL 185:227-260). 

25 Interferon-alphas have been shown to inhibit various types of cellular 

proliferation, and are especially useful for the treatment of a variety of cellular 
proliferation disorders frequently associated with cancer, particulariy hematologic 
malignancies such as leukemias. These proteins have shown antiproliferative activity 
against multiple myeloma, chronic lymphocytic leukemia, low-grade lymphoma, Kaposi's 

30 sarcoma, chronic myelogenous leukemia, renal-cell carcinoma, urinary bladder tumors and 
ovarian cancers (Bonnem, E.M. et al (1984) 7. Biol Response Modifiers 3:580; Oldham, 
R.K. (1985) Hospital Practice 20:71). 

1 
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Interferon-alphas are also useful against various types of viral infections 
(Pinter, N.B. et a/. (1991) Drugs 42(5):749), Interferon-alphas have shown activity 
against human papillomavirus infection, Hepatitis B, and Hepatitis C infections (Pinter, 
N.B. et aL, 1991, supra; Kashima, H. et al. (1988) Laryngoscope 98:334; Dusheiko, G.M. 

5 etal. (1986)7. Hematology 3 (Supp\c. 2):SI99; Davis, GLetal. (1989)M England], 
Med, 321 : 1501). The role of interferons and interferon receptors in the pathogenesis of 
certain autoimmune and inflammatory diseases has also been investigated (Benoit, P. et al 
{1993) llmmunoL i50(3):707). 

Although these proteins possess therapeutic value in the treatment of a 

10 number of diseases, they have not been optimized for use as pharmaceuticals. For 
example, dose-limiting toxicity, receptor cross-reactivity, and short serum half-lives 
significantly reduce the clinical utility of many of these cytokines (Dusheiko, G. (1997) 
Hepatology 26:1 12S-12IS; Vial, T. and Descotes, J. (1994) Drug Experience 10:115- 
150; Funke, I. et al (1994) Am Hematol 68:49-52; Schomburg, A. et al (1993) J, 

15 Cancer Res, Clin, Oncol 1 19:745-755). Diverse and severe side effect profiles which 
accompany interferon administration include flu-like symptoms, fatigue, neurological 
disorders including hallucination, fever, hepatic enzyme elevation, and leukopenia 
(Pontzer, C.H. et al (1991) Cancer Res, 51:5304; Oldham, 1985, supra). 

The existence of abundant naturally occurring sequence diversity within the 

20 interferon-alphas (and hence a large sequence space of recombinants) along with the 
intricacy of interferon-alpha/receptor interactions and variety of therapeutic and 
prophylactic activities creates an opportunity for the construction of superior interferon 
homologues. 

SUMMARY OF THE INVENTION 

25 The invention provides novel interferon-alpha (IFN-alpha or IFN-a) 

homologue polypeptides, nucleic acids encoding the polypeptides and complementary 
nucleotide sequences thereof, fragments of said polypeptides and nucleic acids, antibodies 
to the polypeptides, and uses therefor, data sets containing character strings of interferon- 
alpha homologue sequences, and automated systems for using the character strings. 
30 In one aspect, the invention includes an isolated or recombinant interferon- 

alpha nucleic acid homologue. Included are a polynucleotide sequences selected from 
' SEQ ED NO: 1 to SEQ ID NO:35, or to SEQ ID NO:72 to SEQ ED NO:78, and 

2 
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complementary polynucleotide sequences thereof. Polynucleotide sequences encoding a 
polypeptide selected from SEQ ID NO:36 to SEQ ID N0:81 or from SEQ ID NO:79 to 
SEQ ID NO:85/and complementary polynucleotide sequences thereof are also a feature of 
the invention. Similarly, a polynucleotide sequence which hybridizes under highly 

5 stringent conditions over substantially the entire length of any of the preceding 
polynucleotide sequences is a feature of the present invention. In addition, a 
polynucleotide sequence comprising a nucleotide fragment of any of the preceding 
polynucleotide sequences which nucleotide fragment encodes a polypeptide having an 
antiproliferative activity in a human Daudi cell line- based cell proliferation assay is a 

10 feature of the invention. Similarly, a polynucleotide sequence comprising a nucleotide 
fragment of any of the polynucleotide sequences of the invention described above and 
below which encodes a polypeptide having antiviral activity in a murine cell line/EMCV - 
based assay is a feature of the invention. 

The invention also includes an isolated or recombinant nucleic acid, 

15 comprising a polynucleotide sequence encoding a polypeptide, wherein the polypeptide 
comprises the amino acid sequence: CDLPQTHSLG-Xn-Xi2-RA-Xi5-Xi6-LL-Xi9-QM- 
X22-R-X24-S-X26-FSCLKDR-X34-DFG-X38-P-X40-EEFD-X45-X46-X47-FQ-X50-X51-QAI- 

X55-X56-X57-HE-X60-X6rQQTFN-X67-FSTK-X72-SS-X75-X76-W-X78-X79-X80-L^^ 
X85-X86-T-X88.L-X9O-QQLN-X95-LEACV-X,o|-Q-XiO3-V-X,O5-X,06-X,O7-X,08-TPL^ 

20 X, ,4-D-X, ,6-ILAV-X,2rKY-X,24-QRrrLYL-X,32-E-X,34-KYSPC.X,40- 

WEVVRAEIMRSFSFSTNLQKRLRRKE, or a conservatively substituted variation 
thereof, where Xu is N or D; X12 is R, S, or K; X15 is L or M; X16 is I, M, or V; X19 is A or 
G; X22 is G or R; X24 is I or T; X26 is P or H; X34 is H, Y of Q; X38 is F or L; X40 is Q or R; 
X45 is G or S; X46 is N or H; X47 is Q or R; X50 is K or R; X51 is A or T; X55 is S or F; X56 ^ 

25 is V or A; X57 is L or F; X60 is M or I; Xai is I or M; X67 is L or F; X72 is D or N; X75 is A 
or V; X76 is A or T; X78 is E or D; X79 is Q or E; Xgo is S, R. T, or N; Xss is E or D; Xgs is 
F or L; Xge is S or Y; Xgg is E or G; X90 is Y, H, N; X95 is D, E, or N; Xioi is I, M, or V; 
X103 is E or G; X105 is G or W; X106 is V or M; Xio? is E, G, or K; Xios is E or G; Xiu is V, 
E, or G; Xu6 is S or P; X121 is K or R; X124 is F or L; X132 is T, I, or M; X134 is K or R; and 

30 Xi4o is A or S. Each of the single letters of this amino acid sequence represents a 

particular amino acid residue according to standard practice known to those of ordinary 
skill in the art. . 
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A polypeptide having any of the preceding sequences, such as those 
embodied in SEQ ID NO:36 to SEQ ID NO:54, is also a feature of the invention. 

In other embodiments, the encoded polypeptide comprises an amino acid 
sequence selected from the group consisting of SEQ ED NO:36 to SEQ ED NO:54; and the 
5 nucleic acid comprises a polynucleotide sequence selected from the group consisting of 
SEQ ID NO: 1 to SEQ ID NO: 19. 

The invention also provides polypeptide fragments of any of SEQ NOS:36- 
70 and SEQ ID NOS:72-79. In one aspect of the invention, such a polypeptide fragment 
exhibits an antiproliferative activity in a human Daudi cell line- based cell proliferation 
iO assay or an antiviral activity in a murine cell line/EMCV - based assay, or both said 

activities. The human Daudi cell line- based cell proliferation assay and antiviral activity 
in a murine cell line/EMCV - based assay are described in greater detail below. In yet 
another aspect, the invention provides a polynucleotide sequence comprising a nucleotide 
fragment of any nucleic acid of the invention described above and below, wherein said 
15 nucleotide fragment encodes a polypeptide fragment that exhibits an antiproliferative 

activity in a human Daudi cell line- based cell proliferation assay or an antiviral activity in 
a murine cell line/EMCV - based assay, or both activities, as is described in greater detail 
below. 

The invention also includes an isolated or recombinant nucleic acid 
20 comprising a polynucleotide sequence encoding a polypeptide, wherein the polypeptide 
comprises an amino acid sequence comprising at least 20 contiguous amino acids of any 
one of SEQ ID NOS:36-70. In other embodiments, the polypeptide of the invention 
comprises an amino acid sequence comprising one or more of amino acid residues (Tyr or 
Gln)34, Gly37, Phe38, Lys71, Ala76, Tyr90, Ilel32, Argl34, Phel52, Lysl60, and 
25 Glul66, wherein the numbering of the amino acid residues corresponds to the numbering 
of residues in the amino acid sequence of SEQ ID NO:36. In various embodiments, the 
encoded polypeptide of the invention comprises at least 30, at least 50, at least 70, at least 
75, at least 100, at least 110, at least 120, at least 130, at least 140, at least 150, at least 
155, at least 160, or at least 165 contiguous amino acid residues of any one of SEQ ED 
30 NOS:36-70. In other embodiments, the encoded polypeptide is at least 150, at least 155, 
at least 160, at least 163, or at least 165 amino acids in length. In another embodiment, the 
. encoded polypeptide is about 166 amino acids in length. In yet other embodiments, the 

4 
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encoded polypeptide comprises an amino acid sequence selected from SEQ ID NO:36, 
SEQ ID NO:37, SEQ ID NO:39, SEQ ED NO:40, SEQ ID N0:41, SEQ ED NO:42, SEQ 
ID NO:45, and SEQ ID NO:46. 

In other embodiments, the invention provides a nucleic acid that comprises 
5 a polynucleotide sequence selected from SEQ ID NO: 1 , SEQ ID N0:2, SEQ ID NO:3, 
SEQ ID N0:4, SEQ ID N0:5. SEQ ID N0:6, SEQ ID NO:7, SEQ ID NO: 10, and SEQ ID 
N0:11. 

In other embodiments, the polypeptide encoded by any nucleic acid or the 
invention described herein or a fragment thereof may have antiproliferative activity in a 

10 human Daudi cell line - based assay, or antiviral activity in a human WISH cell/EMCV- 
based assay. In other embodiments, the encoded polypeptide has antiproliferative activity 
of at least about 8.3x10^ units/milligram in the human Daudi cell line - based assay (1 unit 
is the amount of protein in milligram (mg) required to induce 50% antiproliferative 
activity), or antiviral activity of at about least 2.1x10^ units/milligram (mg) in the human 

15 WISH cell/EMCV-based assay (1 unit is the amount of protein in mg required to induce 
50% antiviral activity). In other embodiments, the encoded polypeptide can bind to a type 
I interferon receptor, preferably a human type I interferon receptor, more preferably a 
human (e.g., type I) interferon-alpha receptor. 

The invention also includes a cell comprising any nucleic acid of the 

20 invention described herein, or which expresses any polypeptide of the invention noted 
herein. In one embodiment, the cell expresses a polypeptide encoded by the nucleic acid 
of the invention as described herein. 

The inveiition also includes a vector comprising any nucleic acid of the 
invention described above and below. The vector can comprise a plasmid, a cosmid, a 

25 phage, or a virus; the vector can be, e.g., an expression vector, a cloning vector, a 

packaging vector, an integration vector, or the like. The invention also includes a cell 
transduced by a vector of the invention. The invention also includes compositions 
comprising any nucleic acid of the invention described above and below, and an excipient, 
preferably a pharmaceutically acceptable excipient. Cells and transgenic animals which 

30 include any polypeptide or nucleic acid of the invention described above and below, e.g., 
produced by transduction of vector, are a feature of the invention. 

5 
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The invention also includes compositions produced by digesting one or 
more of the nucleic acids of the invention described above or belov^/ with a restriction 
endonuclease, an RNAse, or a DNAse; and, compositions produced by incubating one or 
more nucleic acids described above or below in the presence of deoxyribonucelotide 
5 triphosphates and a nucleic acid polymerase, e.g,, a thermostable polymerase. 

The invention also includes compositions comprising two or more nucleic 
acids described above or below. The composition may comprise a library of nucleic acids, 
where the library contains at least about 5, 10, 20 or 50 nucleic acids. 

In another aspect, the invention includes an isolated or recombinant 
10 polypeptide encoded by any nucleic acid described above or below. In one embodiment, 
the polypeptide may comprise a sequence selected from SEQ ID NO:36 to SEQ ID 
NO:70, or SEQ ID NO:79 to SEQ ID NO:85. 

The invention also includes a polypeptide comprising at least 50 contiguous 
amino acids of a protein encoded by a polynucleotide sequence, the polynucleotide 
15 sequence selected from the group consisting of: (a) SEQ ED NO: 1 to SEQ ID NO:35 or 
SEQ ID NO:72 to SEQ ID NO:78; (b) a polynucleotide sequence that encodes a 
polypeptide selected from SEQ ID NO:36 to SEQ ID NO:70 or SEQ ID.NO:79 to SEQ ID 
NO:85; and (c) a complementary sequence of a polynucleotide sequence which hybridizes 
under highly stringent conditions over substantially the entire length of polynucleotide 
20 sequence (a) or (b). In various embodiments, the polypeptide comprises at least about 70, 
100, 120, 130, 140, 150, 155, 160, 165, or 166 contiguous amino acids of the encoded 
protein. 

The invention also includes an isolated or recombinant polypeptide 
comprising an amino acid sequence comprising at least 50 contiguous amino acid residues 

25 of any one of SEQ ID NOS:36-70, and one or more of amino acids Alal9, (Tyr or Gln)34, 
Gly37, Phe38, Lys71, Ala76, Tyr90, IIel32. Argl34, Phel52, Lysl60, and Glul66, where 
the numbering of the amino acids corresponds to that of SEQ ID NO:36. In various 
embodiments, the polypeptide comprises at least about 50, 70, 75, 100, 110, 120, 130, 140 
150, 155, 160, 163, 165, or 166 contiguous amino acids of any one of SEQ ID NOS:36-70. 

30 In more preferred embodiments, the polypeptide comprises at least about 50, 70, 75, 100, 
110, 120, 130, 140, 150, 155, 160, 163, 165, or 166 contiguous amino acid residues of any 
^ one of SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:40, SEQ ID N0:41, 

6 



OCrO fWO 0125438A2_I.> 



wo 01/25438 



PCT/USOO/27781 



SEQ ID NO:42, SEQ ID NO:45, or SEQ ID NO:46. In other embodiments, the 
polypeptide of the invention is at least about 50, 70, 75, 100, 110, 120, 130, 140, 150, 155, 
160, 163, 165, or 166 amino acid residues in length, or is preferably 166 amino acids in 
length. Longer polypeptides, e.g., which comprise purification lags or the like, are also 
5 contemplated. Such polypeptides may display antiproliferative activities in human Daudi 
cell-line based assay and/or antiviral activities in a human WISH cell/EMCV-based assay. 

The invention also includes a polypeptide which specifically binds 
polyclonal antisera raised against at least one antigen, said at least one antigen comprising 
a polypeptide sequence selected from an amino acid sequence set forth in SEQ ID NO:36 

10 to SEQ ID NO:70 or SEQ ID NO:79 to SEQ ID NO:85 or a fragment thereof. In 

particular, the invention provides polypeptides which bind a polyclonal antisera raised 
against at least one antigen, wherein said at least one antigen comprises at least one amino 
acid sequence set forth in SEQ ID NO:36 to SEQ ID NO:70 or SEQ ID NO;79 to SEQ ID 
NO: 85, or a fragment of any of these amino sequences, wherein the polyclonal antisera is 

15 subtracted with one or more known interferon-alpha polypeptides or proteins, including, 
e.g.y a polypeptide or protein encoded by a nucleic acid having or corresponding to one 
or more of the following GenBank™ accession numbers: J00210 (alpha-D), J00207 
(Alpha-a). X02958 (Alpha-6), X02956 (Alpha-5), V00533 (alpha-H). V00542 (alpha-14), 
V00545 (IFN-IB), X03125 (alpha-S), X02957 (alpha-16), V00540 (alpha-21). X02955 

20 (alpha^b), V00532 (alpha-C), X02960 (alpha-?), X02961 (alpharlO pseudogene), R0067 
(Gx-l), 101614, 101787, 107821, M12350 (alpha-F). M38289. V00549 (alpha-2a), and 
I083I3 (alpha-Conl), and other similar or homologous interferon-talpha nucleic acid 
sequences presented in GenBank. 

Any polypeptide described above or below optionally has antiproliferative 

25 activity in a human Daudi cell line - based assay and/or in an antiviral activity in a human 
WISH cell/EMCV-based assay. Any polypeptide described above or below can have 
antiproliferative activity of at least about 8.3xl0^units/mg in the human Daudi cell line - 
based assay or antiviral activity of at least about 2.1x10^ units/mg in the human WISH 
cell/EMCV-based assay. In other embodiments, any polypeptide described above or 

30 below can bind to a type I interferon receptor, preferably a human type I interferon 
receptor, more preferably a human interferon-alpha receptor. 

7 
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In other embodiments, any polypeptide described above or below may 
further include a secretion/localization sequence, e.g., a signal sequence, an organelle 
targeting sequence, a membrane localization sequence, and the like. Any polypeptide 
described herein may further include a sequence that facilitates purification, e.g., an 

5 epitope tag (such as, a FLAG epitope), a polyhistidine tag, a GST fusion, and the like. 
The polypeptide optionally includes a methionine at the N-temiinus. Any polypeptide of 
the invention described herein optionally includes one or more modified amino acids, 
such as a glycosylated amino acid, a PEG-ylated amino acid, a famesylated amino acid, an 
acetylated amino acid, a biotinylated amino acid, a carboxylated amino acid, a 

10 phosphorylated amino acid, an acylated amino acid, or the like. 

The invention also includes compositions comprising any polypeptide 
described herein in an excipient, preferably a pharmaceutically acceptable excipienl. 

The invention also includes an antibody or antisera produced by 
administering one or more of the polypeptides of the invention described herein to a 

15 mammal, wherein the antibody or antisera does not specifically bind to a known alpha- 

interferon polypeptide or protein, including, e.g., any polypeptide or protein encoded by a 
nucleic acid having or corresponding to one or more of the following GenBank accession 
numbers: J00210 (alpha-D), J00207 (Alpha.A), X02958 (Alpha-6), X02956 (Alpha-5), 
V00533 (alpha-H), V00542 (alpha-14), V00545 (IFN-IB). X03125 (alpha-S), X02957 

20 (alpha.l6). V00540 (alpha-21), X02955 (alpha-4b), V00532 (alpha-C), X02960 (alpha-7), 
X02961 (alpha-10 pseudogene), R0067 (Gx-I), 101614, 101787, 107821, M12350 (alpha- 
F). M38289, V00549 (alpha-2a), and 108313 (alpha-Conl). and other similar or 
homologous interferon-alpha sequences presented in GenBank. 

The invention also includes antibodies which specifically bind a 

25 polypeptide comprising a sequence selected from SEQ ID NO:36 to SEQ ID NO:70 or 
SEQ ID NO:79 to SEQ ID NO:85. The antibodies are, e.g., polyclonal, monoclonal, 
chimeric, humanized, single chain, Fab fragments, fragments produced by an Fab 
expression library, or the like. 

Methods for producing the polypeptides of the invention are also included. 

30 One such method comprises introducing into a population of cells any nucleic acid 
described herein, operatively linked to a regulatory sequence effective to produce the 
' encoded polypeptide, culturing the cells in a culture medium to produce the polypeptide, 

8 
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and optionally isolating the polypeptide from the cells or from the culture medium. The 
nucleic acid may be part of a vector, such as a recombinant expression vector. 

The invention also includes a method of inhibiting growth of tumor cells, 
by contacting the tumor cells with a polypeptide of the invention described herein, 

5 thereby inhibiting growth of the tumor cells. In one embodiment, the invention includes a 
method of inhibiting growth of population of tumor cells comprising contacting the 
population of tumor cells with an effective amount of a polypeptide of the invention 
sufficient to inhibit growth of tumor cells in said population of tumor cells, thereby 
inhibiting growth of tumor cells in said population of cells. In various embodiments, the 

10 tumor cells can be human carcinoma cells, human leukemia cells, human T-lymphoma 
cells, human melanoma cells, other human cancer cells as described herein, and the like. 
The tumor cells can be in vivo, ex vivo, or in vitro (e.g., cultured cells). 

The invention also includes a method of inhibiting the replication of a virus 
within one or more cells infected by the virus, by contacting one or more of the infected 

15 cells with an effective amount of a polypeptide of the invention as described above and 
below, wherein said amount is sufficient to inhibit viral replication in said one or more 
infected cells, thereby inhibiting replication of the virus in the one or more cells. In 
various embodiments, the virus can be an RNA virus, e.g., a human immunodeficiency 
virus or a hepatitis C virus, or a DNA virus, e.g., a hepatitis B virus. The infected cells 

20 can be in vivo, ex vivo, or in vitro {e.g., cultured cells). 

The invention also includes a method of treating an autoimmune disorder 
in a subject in need of such treatment, by administering to the subject an effective amount 
of a polypeptide of the invention as described herein sufficient to treat the autoimmune 
disorder. In various embodiments, the autoinunune disorder may be multiple sclerosis, 

25 rheumatoid arthritis, lupus erythematosus, type I diabetes, and the like. The invention 
also includes, in a method of treating a disorder treatable by administration of interferon- 
alpha to a subject, an improvement comprising administering to the subject an effective 
amount of a polypeptide of the invention as described herein sufficient to treat said 
disorder. The disorder treatable by administration of inlerferon-alpha disorder may be 

30 multiple sclerosis, rheumatoid arthritis, lupus erythematosus, type I diabetes, ADDS or 
AIDS-related complexes, or the like. 

9 
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In general, nucleic acids and proteins derived by mutation of the sequences 
herein are a feature of the invention. Similarly, those produced by diversity generation or 
recursive sequence recombination (RSR) methods {e,g., DNA shuffling) are a feature of 
the invention. Mutation and recombination methods using the nucleic acids described 

5 herein are a feature of the invention. . For example, one method of the invention includes 
recursively recombining one or more nucleic acid sequences of the invention as described 
above and belov^ with one or more additional nucleic acids (including, but not limited to, 
those noted herein), each sequence of the one or more additional nucleic acids encoding an 
interferon-alpha homologue or an amino acid subsequence thereof. The recombining steps 

10 are optionally performed in vivo, ex vivo, in silico or in vitro. Said recursive 

recombination produces at least one library of recombinant interferon-alpha homologue 
nucleic acids. Also included in the invention are a recombinant interferon-alpha 
homologue nucleic acid produced by this method, a cell containing the recombinant 
interferon-alpha homologue nucleic acid, a nucleic acid library produced by this recursive 

1 5 recombination method, a composition comprising two or more of said recombinant 
interferon-alpha nucleic acids, and a population of cells comprising such recombinant 
interferon-alpha nucleic acids or containing the library. In one embodiment, the library 
comprise at least ten such recombinant nucleic acids. 

The invention also provides a method of producing a modified or 

20 recombinant interferon-alpha homologue nucleic acid that comprises mutating a nucleic 
acid of the invention as described herein. 

Also provided are nucleic acids that encode an interferon-alpha homologue 
having an increased growth inhibition activity, cytostatic activity, or cytotoxic activity 
against a population of cells {e.g., cancer cells) relative to the growth inhibition activity 

25 cytostatic activity, or cytotoxic activity, respectively, of human interferon-alpha 2a or 
other known interferon-alpha against the population of cells. 

These and other objects and features of the invention will become more 
fully apparent when the following detailed description is read in conjunction with the 
accompanying figures. 
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BRIEF DESCRIPTION OF THE FIGURES 
Figures 1 A-IE show an alignment of exemplary mature interferon 

homologue polypeptide sequences (SEQ ID NOS: 36-70 and 79-85) according to the 

invention. 

5 Figure 2 shows antiproliferative activities in a human Daudi cell line - 

based assay and antiviral activities in a human WISH cell/EMCV-based assay of, 
respectively, exemplary interferon homologues of the present invention relative to the 
respective antiproliferative and antiviral activities of two control compounds, human 
interferon alpha-2a ("IFN-a-2a" or "2a") and consensus human interferon ("IFN-Conl" or 

10 "Conl"). 

Figures 3A, 3B, and 3C illustrate activity profiles of IFN-alpha homologue 
3DA1 1 (SEQ ID NO:40) and control interferons, human interferon alpha-2a ( '*2a'*) and 
consensus human interferon alpha ( "Conl"). against a panel of tumor cell lines. Fig. 3A 
shows the cell total growth inhibitory activity of IFN-alpha homologue 3DA1 1 and each 

15 control BFN on each respective cell line as reflected in the GI50 value, which is the 

concentration (ng/ml) of interferon alpha homologue or control IFN alpha at which growth 
of a particular cell line is inhibited by 50%, as measured by a 50% reduction in the net 
protein/polypeptide increase in the interferon alpha homologue or control IFN alpha at the 
end of the incubation period. 

20 Fig. 3B shows the cytostatic activity of IFN-alpha homologue 3DA1 1 and 

each control BFN on each cell line of the panel of cell lines. Cytostatic activity refers to an 
activity capable of suppressing growth and multiplication of cells. Cytostatic activity is 
assessed as a reflection of the concentration of IFN-alpha homologue 3DA1 1 or control 
IFN (jig/ml) at which the growth and/or multiplication of cells of a particular cell line is 

25 completely inhibited or suppressed, such that the amount of cellular protein at the end of 
the incubation period equals the amount of cellular protein at the beginning of the 
incubation period ("total growth inhibition" or *TGI"). 

Fig. 3C illustrates the cytotoxic activity of IFN-alpha homologue 3DA11 
and each control IFN on each respective cell line. The cytotoxicity of an agent (e.g., an 

30 IFN homologue or IFN compound) is the degree to which the agent possess a specific 

destructive action on certain cells or the possession of such action. The term typically 

refers to an agent capable of causing cell death and is used particularly in referring to the 

lysis of cells by immune phenomena and to agents of compounds that selectively kill 
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dividing cells. In Fig. 3C, cytotoxic activity is illustrated as LC50, the concentration of 
IFN-alpha homologue 3DA11 (ng/ml) at which a 50% reduction in the net protein increase 
in control cells (control IFN alpha) at the end of the incubation as compared to that at the 
beginning of the incubation period is observed, indicating a net loss of cells following 
5 addition of the particular interferon. Cytotoxic activity may be assessed as the 

concentration of EFN-alpha homologue 3DA1 1 at which, relative to the control cells. 50% 
of the total number of cells {Le,, total population) of a particular cell line are destroyed or 
killed. 

Figs. 4A, 4B, 4C, and 4D show the cytostatic activity of selected 

10 inlerferon-alpha homologues of the present invention relative to the cytostatic activities of 
two control interferon alphas, human interferon-alpha 2a C'2a") and consensus human 
inierferon-alpha ("Conl"), against a leukemia cell line (RPMI-8226) (Fig. 4A), a lung 
cancer cell line (NCI-H23) (Fig. 4B), a renal cancer cell line (ACHN) (Fig. 4C), and an 
ovarian cancer cell line (OVCAR-3) (Fig. 4D), respectively. Cytostatic activity is 

15 reflected by a TGI value for a particular interferon alpha (i.e., the concentration of 

interferon alpha at which cell growth of a cell line is totally inhibited, wherein the amount 
of cellular protein at the end of the incubation period equals the amount of cellular protein 
at the beginning of the incubation period). 

Fig. 5 presents a comparison of the number of mice (out of a total number 

20 of six mice) that survived following administration of doses of 2 jig, 10 fig, and 50 \ig of 
two exemplary DFN-alpha homologues of the present invention (designated "IFN-CH2.2" 
and "IFN-CH2.3"), doses of 2 \ig, 10 ng, and 50 \ig of murine IFN-alpha-4, and doses of 2 
lig, 10 fig, and 50 fig of human IFN-alpha-2a, respectively. The results shown in Fig. 5 
demonstrate that in a murine model system, the improved in vitro antiviral activity of these 

25 two exemplary IFN-alpha homologues is maintained and sustained in vivo. Phosphate- 
buffered saline (PBS) is used as a control. 

DETAILED DESCRIPTION OF THE INVENTION 

DEFINITIONS 

30 Unless otherwise defined herein or below in the remainder of the 

specification, all technical and scientific terms used herein have the same meaning as 
commonly understood by those of ordinary skill in the art to which the present invention 
belongs. 
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A "polynucleotide sequence" is a nucleic acid (which is a polymer of 
nucleotides (A,C.T,U,G, etc. or naturally occurring or artificial nucleotide analogues)) or a 
character string representing a nucleic acid, depending on context. Either the given 
nucleic acid or the complementary nucleic acid can be determined from any specified 

5 polynucleotide sequence. 

Similarly, an '*amino acid sequence" is a polymer of amino acids (a protein, 
polypeptide, ew.) or a character string representing an amino acid polymer, depending on 
context. Either the given nucleic acid or the complementary nucleic acid can be 
determined from any specified polynucleotide sequence. 

10 A nucleic acid, protein, peptide, polypeptide, or other component is 

"isolated" when it is partially or completely separated from components with which it is 
normally associated (other peptides, polypeptides, proteins (including complexes, e.g., 
polymerases and ribosomes which may accompany a native sequence), nucleic acids, 
cells, synthetic reagents, cellular contaminants, cellular components, etc), e.g., such as 

15 from other components with which it is normally associated in the cell from which it was 
originally derived. A nucleic acid, polypeptide, or other component is isolated when it is 
partially or completely recovered or separated from other components of its natural 
environment such that it is the predominant species present in a composition, mixture, or 
collection of components (i.e., on a molar basis it is more abundant than any other 

20 individual species in the composition). In preferred embodiments, the preparation consists 
of more than 70%, typically more than 80%, or preferably more than 90% of the isolated 
species. 

In one aspect, a "substantially pure" or "isolated" nucleic acid (e.g., RNA or 
DNA), polypeptide, protein, or composition also means whierc the object species {e.g., 

25 nucleic acid or polypeptide) comprises at least about 50, 60, or 70 percent by weight (on a 
molar basis) of all macromolecular species present. A substantially pure or isolated 
composition can also comprise at least about 80, 90, or 95 percent by weight of all 
macromolecular species present in the composition. An isolated object species can also be 
purified to essential homogeneity (contaminant species cannot be detected in the 

30 composition by conventional detection methods) wherein the composition consists 
essentially of derivatives of a single macromolecular species. 
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The lenn "isolated nucleic acid" may refer to a nucleic acid (e.^., DNA or 
RNA) that is not immediately contiguous with both of the coding sequences with which it 
is immediately contiguous one at the 5' and one at the 3' end) in the naturally 
occurring genome of the organism from which the nucleic acid of the invention is derived. 
5 Thus, this term includes, e.g., a cDNA or a genomic DNA fragment produced by 
polymerase chain reaction (PGR) or restriction endonuclease treatment, whether such 
cDNA or genomic DNA fragment is incorporated into a vector, integrated into the genome 
of the same or a different species than the organism, including, e.g., a virus, from which it 
was originally derived, linked to an additional coding sequence to form a hybrid gene 
10 encoding a chimeric polypeptide, or independent of any other DNA sequences. The DNA 
may be double-stranded or single-stranded, sense or antisense. 

A nucleic acid or polypeptide is "recombinant" when it is artificial or 
engineered, or derived from an artificial or engineered protein or nucleic acid. The term 
"recombinant" when used with reference e.g., to a cell, nucleotide, vector, or polypeptide 
15 typically indicates that the cell, nucleotide, or vector has been modified by the introduction 
of a heterologous (or foreign) nucleic acid or the alteration of a native nucleic acid, or that 
the polypeptide has been modified by the introduction of a heterologous amino acid, or 
that the cell is derived from a cell so modified. Recombinant cells express nucleic acid 
sequences (e.g., genes) that are not found in the native (non-recombinant) form of the cell 
20 or express native nucleic acid sequences (e.g., genes) that would be abnormally expressed 
under-expressed, or not expressed at all. The term "recombinant nucleic acid" (e.g., DNA 
or RNA) molecule means, for example, a nucleotide sequence that is not naturally 
occurring or is made by the combatant (for example, artificial combination) of at least two 
segments of sequence that are not typically included together, not typically associated with 
25 one another, or are otherwise typically separated from one another. A recombinant nucleic 
acid can comprise a nucleic acid molecule formed by the joining together or combination 
of nucleic acid segments from different sources and/or artificially synthesized. The term 
"recombinantly produced" refers to an artificial combination usually accomplished by 
either chemical synthesis means, recursive sequence recombination of nucleic acid 
30 segments or other diversity generation methods (such as, e.g., shuffling) of nucleotides, or 
manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
. known to those of ordinary skill in the art. "Recombinantly expressed" typically refers to 
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techniques for the production of a recombinant nucleic acid in vitro and transfer of the 
recombinant nucleic acid into cells in vivo, in vitro, or ex vivo where it nnay be expressed 
or propagated. A "reconfibinani polypeptide" or "recombinant protein" usually refers to 
polypeptide or protein, respectively, that results from a cloned or recombinant gene or 
5 nucleic acid. 

A ^'subsequence" or "fragment" is any portion of an entire sequence, up to 
and including the complete sequence. 

Numbering of a given amino acid or nucleotide polymer "corresponds to 
numbering" of a selected amino acid polymer or nucleic acid when the position of any 

10 given polymer component (amino acid residue, incorporated nucleotide, etc) is designated 
by reference to the same residue position in the selected amino acid or nucleotide, rather 
than by the actual position of the component in the given polymer. 

A vector is a composition for facilitating cell transduction by a selected 
nucleic acid, or expression of the nucleic acid in the cell. Vectors include, e.g., plasmids, 

15 cosmids, viruses, YACs, bacteria, poly-lysine, etc. An "expression vector" is a nucleic 
acid construct, generated recombinantly or synthetically, with a series of specific nucleic 
acid elements that permit transcription of a particular nucleic acid in a host cell. The 
expression vector can be part of a plasmid, virus, or nucleic acid fragment. The 
expression vector typically includes a nucleic acid to be transcribed operably linked to a 

20 promoter. 

"Substantially an entire length of a polynucleotide or amino acid sequence" 
refers to at least about 50%, at least about 60%, generally at least about 70%, generally at 
least about 80%, or typically at least about 90%, 95,%, 96%, 97%, 98%, or 99% or more 
of a length of an amino acid sequence or nucleic acid sequence. 

25 "A human alpha-interferon receptor" is a receptor which is naturally 

activated in human cells by an alpha interferon. 

"Naturally occurring" as applied to an object refers to the fact that the 
object can be found in nature. For example, a polypeptide or polynucleotide sequence that 
is present in an organism, including viruses, that can be isolated from a source in nature 

30 and which has not been intentionally modified by man in the laboratory is naturally 
occurring. In one. aspect, a "naturally occurring" nucleic acid (e.g,, DNA or RNA) 
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molecule is a nucleic acid molecule that exists in the same stale as it exists in nature; that 
is, the nucleic acid molecule is not isolated, recombinant, or cloned. 

As used herein, an "antibody" refers to a protein comprising one or more 
polypeptides substantially or partially encoded by inrmiunoglobulin genes or fragments of 

5 immunoglobulin genes. The recognized immunoglobulin genes include the kappa, 
lambda, alpha, gamma, delta, epsilon and mu constant region genes, as well as myriad 
immunoglobulin variable region genes. Light chains are classified as either kappa or 
lambda. Heavy chains are classified as gamma, mu, alpha, delta, or epsilon, which in turn 
define the immunoglobulin classes, IgG, IgM, IgA, IgD and IgE, respectively. A typical 

10 immunoglobulin (e.g., antibody) structural unit comprises a tetramer. Each tetramer is 
composed of two identical pairs of polypeptide chains, each pair having one "light" (about 
25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus of each chain defines a 
variable region of about 100 to 1 10 or more amino acids primarily responsible for antigen 
recognition. The terms variable light chain (VL) and variable heavy chain (VH) refer to 

15 these light and heavy chains, respectively. Antibodies exist as intact immunoglobulins or 
as a number of well characterized fragments produced by digestion with various 
peptidases. Thus, for example, pepsin digests an antibody below the disulfide linkages in 
the hinge region to produce F(aby2, a dimer of Fab which itself is a light chain joined to 
VH-CHl by a disulfide bond. The F(ab)'2 may be reduced under mild conditions to break 

20 the disulfide linkage in the hinge region thereby converting the (Fab')2 dimer into an Fab* 
monomer. The Fab' monomer is essentially an Fab with part of the hinge region (see 
. Fundamental Immunology. W.E. Paul, ed.. Raven Press, N.Y. (1993), for a more detailed 
description of other antibody fragments). While various antibody fragments are defined in 
terms of the digestion of an intact antibody, one of skill will appreciate that such Fab' 

25 fragments may be synthesized de novo either chemically or by utilizing recombinant DNA 
methodology. Thus, the term antibody, as used herein also includes antibody fragments 
either produced by the modification of whole antibodies or synthesized de novo using 
recombinant DNA methodologies. Antibodies include single chain antibodies, including 
single chain Fv (sFv) antibodies in which a variable heavy and a variable light chain are 

30 joined together (directly or through a peptide linker) to form a continuous polypeptide. 

An "antigen-binding fragment" of an antibody is a peptide or polypeptide 
^ fragment of the antibody which binds an antigen. An antigen-binding site is formed by 
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those amino acids of the antibody which contribute to, are involved in, or affect the 
binding of the antigen. See Scott, T.A. and Mercer, E.I., CONCISE Encyclopedia: 
Biochemistry and Molecular Biology (de Gruyier. 3d ed. 1997) [hereinafter "Scott, 
Concise Encyclopedia"] and Watson, J.D. ei al.. Recombinant DN A (2d ed. 1992) 

5 [hereinafter "Watson, Recombinant DNA"], each of which is incorporated herein by 
reference in its entirety for all purposes. 

An "immunogen" refers to a substance that is capable of provoking an 
immune response. Examples of immunogens include, e,g., antigens, autoantigens that 
play a role in induction of autoimmune diseases, and tumor-associated antigens expressed 

10 on cancer cells. 

An "antigen" is a substance that is capable of eliciting the formation of 
antibodies in a host or generating a specific population of lymphocytes reactive with that 
substance. Antigens are typically macromolecules (e.g., proteins and polysaccharides) 
that are foreign to the host. 

15 The term "immunoassay" includes an assay that uses an antibody or 

immunogen to bind or specifically bind an antigen. The immunoassay is typically 
characterized by the use of specific binding properties of a particular antibody to isolate, 
target, and /or quantify the antigen. 

The term "homology" generally refers to the degree of similarity between 

20 two or more structures. The term "homologous sequences" refers to regions in 

macromolecules that have a similar order of monomers. When used in relation to nucleic 
acid sequences, the term "homology" refers to the degree of similarity between two or 
more nucleic acid sequences (e.g., genes) or fragments thereof. Typically, the degree of 
similarity between two or more nucleic acid sequences refers to the degree of similarity of 

25 the composition, order, or arrangement of two or more nucleotide bases (or other 

genotypic feature) of the two or more nucleic acid sequences. The term "homologous 
nucleic acids" generally refers to nucleic acids comprising nucleotide sequences having a 
degree of similarity in nucleotide base composition, arrangement, or order. The two or 
more nucleic acids may be of the same or different species or group. The term "percent 

30 homology" when used in relation to nucleic acid sequences, refers generally to a percent 
degree of similarity between the nucleotide sequences of two or more nucleic acids. 

17 



DCXID- <WO G1254o6A2.l > 



wo 01/25438 



PCT/USOO/27781 



When used in relation to polypeptide (or protein) sequences, the term 
"homology" refers to the degree of similarity between two or more polypeptide (or 
protein) sequences (e.g., genes) or fragments thereof. Typically, the degree of similarity 
between two or more polypeptide (or protein) sequences refers to the degree of similarity 

5 of the composition, order, or arrangement of two or more amino acid of the two or more 
polypeptides (or proteins). The two or more polypeptides (or proteins) may be of the same 
or different species or group. The term "percent homology" when used in relation to 
polypeptide (or protein) sequences, refers generally to a percent degree of similarity 
between the amino acid sequences of two or more polypeptide (or protein) sequences. The 

10 term "homologous polypeptides" or "homologous proteins" generally refers to 

polypeptides or proteins, respectively, that have amino acid sequences and functions that 
. are similar. Such homologous polypeptides or proteins may be related by having amino 
acid sequences and functions that are similar, but are derived or evolved from different or 
the same species using the techniques described herein. 

15 The term "subject" as used herein includes, but is not limited to, an 

organism; a mammal, including, e.g., a human, non-human primate (e.g., monkey), mouse, 
pig, cow, goat, rabbit, rat, guinea pig, hamster, horse, monkey, sheep, or other non-human 
mammal; a non-mammal, including, e.g., a non-mammalian vertebrate, such as a bird 
(e.g., a chicken or duck) or a fish; and a non-mammalian invertebrate. 

20 The term "pharmaceutical composition" means a composition suitable for 

pharmaceutical use in a subject, including an animal or human. A pharmaceutical 
composition generally comprises an effective amount of an active agent and a 
pharmaceutically acceptable carrier. 

The tenn "effective amount" means a dosage or amount sufficient to 

25 produce a desired result. The desired result may comprise an objective or subjective 
improvement in the recipient of the dosage or amount. 

A "prophylactic treatment" is a treatment administered to a subject who 
does not display signs or symptoms of a disease, pathology, or medical disorder, or 
displays only early signs or symptoms of a disease, pathology, or disorder, such that 

30 treatment is administered for the purpose of diminishing, preventing, or decreasing the risk 
of developing the disease, pathology, or medical disorder. A prophylactic treatment 
^ functions as a preventative treatment against a disease or disorder. A "prophylactic 
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activity" is an activity of an agent, such as a nucleic acid, vector, gene, polypeptide, 
protein, substance, composition thereof that, when administered to a subject who does not 
display signs or symptoms of pathology, disease or disorder, or who displays only early 
signs or symptoms of pathology, disease, or disorder, diminishes, prevents, or decreases 
5 the risk of the subject developing a pathology, disease, or disorder. A "prophylactically 
useful" agent or compound (e.g,y nucleic acid or polypeptide) refers to an agent or 
compound that is useful in diminishing, preventing, treating, or decreasing development of 
pathology, disease or disorder. 

A "therapeutic treatment" is a treatment administered to a subject who 
10 displays symptoms or signs of pathology, disease, or disorder, in which treatment is 
administered to the subject for the purpose of diminishing or eliminating those signs or 
symptoms of pathology, disease, or disorder. A "therapeutic activity" is an activity of an 
agent, such as a nucleic acid, vector, gene, polypeptide, protein, substance, or composition 
thereof, that eliminates or diminishes signs or symptoms of pathology, disease or disorder, 
15 diminishes when administered to a subject suffering from such signs or symptoms. A 
"therapeutically useful" agent or compound (e.g., nucleic acid or polypeptide) indicates 
that an agent or compound is useful in diminishing, treating, or eliminating such signs or 
symptoms of a pathology, disease or disorder. 

The term "gene" broadly refers to any segment of DNA associated with a 
20 biological function. Genes include coding sequences and/or regulatory sequences required 
for their expression. Genes also include non-expressed DNA nucleic acid segments that, 
e.g.y form recognition sequences for other proteins. 

Generally, the nomenclature used hereafter and the laboratory procedures 
in cell culture, molecular genetics, molecular biology, nucleic acid chemistry, and protein 
25 chemistry described below are those well known and commonly employed by those of 
ordinary skill in the art. Standard techniques, such as described in Sambrook et al. 
Molecular Cloning - A Laboratory Manual (2nd Ed.), Vols. 1-3, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York, 1989 (hereinafter "Sambrook") and Current 
Protocols in Molecular Biology, F.M. Ausubel et aL, eds.. Current Protocols, a joint 
30 venture between Greene Publishing Associates, Inc. and John Wiley & Sons, Inc. 

(supplemented through 1999) (hereinafter "Ausubel"), are used for recombinant nucleic 
' acid methods, nucleic acid synthesis, cell culture methods, and transgene incorporation, 
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e.g., electroporalion, injection, and lipofeclion. Generally, oligonucleotide synthesis and 
purification steps are performed according to specifications. The techniques and 
procedures are generally performed according to conventional methods in the an and 
various general references which are provided throughout this document. The procedures 
5 therein are believed to be well known to those of ordinary skill in the art and are provided 
for the convenience of the reader. 

A variety of additional terms are defined or otherwise characterized herein. 

POLYNUCLEOTIDES OF THE INVENTION 

Interferon-alpha Homolosue Sequences 
10 The invention provides isolated or recombinant interferon-alpha homologue 

polypeptides, and isolated or recombinant polynucleotides encoding the polypeptides. 

As described in more detail below, in accordance with the present 

invention, polynucleotide sequences which encode novel interferon-alpha homologue 

polypeptides, nucleotide sequences {e.g., subsequences) that encode fragments of 

15 interferon-alpha homologue polypeptides, and nucleotide sequences that encode related 
fusion polypeptides or proteins, or functional equivalents thereof, are collectively referred 
to herein as "interferon-alpha homologues," "interferon homologue nucleic acids." "IFN- 
alpha homologues," "IFN homologues," "IFN nucleic acids," "interferon homologues." 
"interferon nucleic acids, " "recombinant interferon-alpha," "recombinant interferon-alpha 

20 nucleic acids," "nucleic acids of the invention," "polynucleotides of the invention." or 
"nucleotides of the invention." Polynucleotide, nucleotide are nucleic acid fragments of 
each of the preceding terms are also intended to be included and encompassed in 
polynucleotides, nucleotides, and nucleic acids of the invention. The term "nucleic acid" 
is used interchangeable with the term "nucleotide." 

25 Polynucleotides encoding the polypeptides of the invention were 

discovered in libraries of shuffled interferon-alpha related sequences. The library 
members were screened for antiproliferative activity against human tumor cell lines and, 
in some cases, assayed for antiviral activity against virus-infected human cells. A subset 
of the sequences provided herein were discovered in shuffled libraries screened for 

30 antiviral activity against virus-infected mouse cells. Coding sequences for interferon 
homologues were identified as described in the examples. 
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Briefly, libraries of shuffled mature interferon-alpha coding sequences were 
introduced into E, coli. Colonies were screened in a high-throughput antiproliferative 
activity assay against a human Daudi tumor cell line as described in Example 1, and 
colonies expressing active polypeptides were selected, re-screened, and expression levels 

5 determined. DNA from selected colonies was isolated and re-shuffled to create secondary 
libraries. The secondary libraries were introduced into E. coli and screened for 
antiproliferative activity in the human Daudi cell line-based cell proliferation 'assay. DNA 
from colonies selected from the primary and secondary library screens were transduced 
into Chinese hamster ovary (CHO) cells, and stable cell lines were generated. CHO- 

10 expressed proteins were purified, quantitated, and assayed for antiproliferative activity 
using the human Daudi cell line, and optionally, for antiviral activity using 
enccphalomyocarditis virus (EMCV)-infected human WISH cells, as described in 
Example 1. Exemplary shuffled nucleic acids which encode interferon-alpha homologue 
polypeptides having antiproliferative activity in the human Daudi cell line-based assay are 

15 identified herein as SEQ ID NO: 1 to SEQ ID NO:35, which encode mature interferon- 
alpha homologue polypeptides identified herein as SEQ ID NO:36 to SEQ ID NO:70, 
respectively. Libraries of shuffled mature interferon-alpha coding sequences were also 
screened in a high-throughput antiviral activity screen against EMCV-infected mouse 
cells. Exemplary shuffled nucleic acids which encode polypeptides having antiviral 

20 activity in the murine cell /EMCV-based assay are identified herein as SEQ ID NO:72 to 
SEQ ED NO:78. which encode mature interferon homologue polypeptides identified 
herein as SEQ ID NO:79 to SEQ ID NO:85. 

In another aspect, the invention provides an isolated or recombinant nucleic 
acid that comprises a polynucleotide sequence selected from the group of: (a) SEQ ID 

25 N0:1 to SEQ ID NO:35, or a complementary polynucleotide sequence thereof; (b) a 

polynucleotide sequence encoding a polypeptide selected from SEQ ID NO:36 to SEQ ID 
N0:71, or a complementary polynucleotide sequence thereof; (c) a polynucleotide 
sequence which hybridizes under at least stringent or at least highly stringent hybridization 
conditions (or ultra-high stringent or ultra-ultra- high stringent hybridization conditions) 

30 over substantially the entire length of polynucleotide sequence (a) or (b), or with a 50, 
120, 130, 140, 145, 150. 155, 160, or 165 nucleotide base subsequence or fragment of a 
polynucleotide sequence of (a) or (b); and (d) a polynucleotide sequence comprising a 
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fragment of (a), (b). or (c), which fragment encodes all or a part of a polypeptide having 
an antiproliferative activity in a human Daudi cell line-based assay or an antiviral activity 
in an assay known in the art for measuring antiviral activity. 

In another aspect, the invention provides an isolated or recombinant nucleic 

5 acid that comprises a polynucleotide sequence selected from the group of: (a) SEQ ID 
NO:72 to SEQ ID NO:78, or a complementary polynucleotide sequence thereof; (b) a 
polynucleotide sequence encoding a polypeptide selected from SEQ ID NO:79 to SEQ ED 
NO:85, or a complementary polynucleotide sequence thereof; (c) a polynucleotide 
sequence which hybridizes under at least stringent or at least highly stringent hybridization 

10 conditions (or ultra-high stringent or ultra-ultra- high stringent hybridization conditions) 
over substantially the entire length of polynucleotide sequence (a) or (b), or with a 50, 
120, 130, 140, 145, 150, 155, 160, or 165 nucleotide base subsequence or fragment of a 
polynucleotide sequence of (a) or (b); and (d) a polynucleotide sequence comprising a 
fragment of (a), (b), or (c), which fragment encodes all or a part of a polypeptide having 

15 an antiproliferative activity in a human Daudi cell line-based assay or an antiviral activity 
in a murine cell line/EMCV-based assay. 

The present invention also includes a mature interferon-alpha homologue 
polypeptide comprising the amino acid identified herein as SEQ ID N0:71 and a 
polynucleotide sequence encoding said polypeptide or a fragment of said polypeptide 

20 having an antiproliferative activity in the human Daudi cell line-based assay and/or an 
antiviral activity in the murine cell /EMCV-based assay. 

The invention also includes an isolated or recombinant nucleic acid 
comprising a polynucleotide sequence encoding a polypeptide, wherein the polypeptide 
comprises the amino acid sequence: CDLPQTHSLG-XirXi2-RA-Xi5-Xi6-IJ--Xi9-QM- 

25 X22-R-X24-S-X26-FSCLKDR-X34-DFG.X38-P-X40-EEFD-X45-X46-X47-FQ-X50-X5rQA 

X55-X56-X57-HE-X60-X6rQQTFN-X67-FSTK-X72-SS-X75-X76-W-X78-X79-X80-LL-X83-K- 
X85-X86-T-X88-L-X9O-QQLN-X95-LEACV-X,orQ-X,O3-V-Xi05-X|06-X,07-Xi08-T^^ 

X,i4-D-Xn6-ILAV.X,2rKY-Xi24-QRITLYL-X,32-E-Xi34-KYSPC-Xi40- 
WEVVRAEIMRSFSFSTNLQKRLRRKE, or a conservatively substituted variation 
30 thereof, where X,, is N or D; X12 is R, S, or K; X15 is L or M; Xieis I, M, or V; X19 is A or 
G; X22 is G or R; X24 is I or T; X26 is P or H; X34 is H, Y or Q; X38 is F or L; X40 is Q or R; 
' X45 is G or S; X46 is N or H; X47 is Q or R; X50 is K or R; X51 is A or T; X55 is S or F; X56 
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is V or A; X57 is L or F; Xeo is M or I; Xei is I or M; X^i is L or F; X72 is D or N; X75 is A 
or V; X76 is A or T; X78 is E or D; X79 is Q or E; Xgo is S, R, T. or N; Xgs is E or D; Xgs is 
F or L; X86 is S or Y; Xgg is E or G; X90 is Y, H. N; X95 is D, E, or N; Xioi is 1, M, or V; 
Xio3 is E or G; Xios is G or W; X|06 is V or M; X,o7 is E, G, or K; Xiog is E or G; X| u is V, 

5 E, or G; X, ,6 is S or P; X]2i is K or R; X124 is F or L; X132 is T, I, or M; X,34 is K or R; and 
X|40 is A or S. Each of the single letters of this amino acid sequence represents a 
particular amino acid residue according to standard practice known to those of ordinary 
skill in the art. Such polypeptides having an antiproliferative activity in the human Daudi 
cell line-based assay {e,g,, at least about 8.3x10^ unils/mg) and/or an antiviral activities in 

10 a human WISH cell/EMC V-based assay (at least about 2.1x10^ units/mg). 

As described in greater detail below, the polynucleotides of the invention 
are useful in for a variety of applications, including, but not limited to, as therapeutic and 
prophylactic agents in methods of in vivo and ex vivo treatment of a variety of diseases, 
disorders, and conditions in a variety of subjects; for use in in vitro methods, such as 

15 diagnostic methods, to detect, diagnose, and treat a variety of diseases, disorders, and 
conditions in a variety of subjects; for use in, e.g., gene therapy; as therapeutics and 
prophylactics, e.g., for use in methods of therapeutic and prophylactic treatment of a 
disease, disorder or condition; as immunogens; for use in diagnostic and screening assays; 
and as diagnostic probes for the presence of complementary or partially complementary 

20 nucleic acids (including for detection of IFN-alpha coding nucleic acids). 

Making Polynucleotides of the Invention 

Polynucleotides and oligonucleotides of the invention can be prepared by 
standard solid-phase methods, according to known synthetic methods. Typically, 
fragments of up to about 20. 30, 40, 50, 60, 70, 80, 90, and/or 100 nucleotide bases are 

25 individually synthesized, then joined {eg. , by enzymatic or chemical ligation methods, or 
polymerase mediated recombination methods) to form essentially any desired continuous 
sequence. In another aspect, nucleotide fragments of greater than 100 nucleotide bases 
(e.g., 150, 180, 200, 210, 240, 270, 300, 330, 360, 390, 400, 420, 450, 465, 474, 470, 475, 
489, 490, 495. 496 bases) are individually synthesized, then joined (e.g., by enzymatic or 

30 chemical ligation methods, or polymerase mediated recombination methods) to form 
essentially any desiinedcontinuous sequence, example, the polynucleotides and 
' oligonucleotides of the invention, including fragments thereof (and those as described 
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herein), can be prepared by chemical synthesis using, e.g., the classical phosphoramidite 
method described by Beaucage et a/. (1981) Tetrahedron Letters 22: 1859-69. or the 
method described by Matthes et al. (1984) EMBO J. 3:801-05., e.g., as is typically 
practiced in automated synthetic methods. According to the phosphoramidite method, 
oligonucleotides are synthesized, e.g., in an automatic DNA synthesizer, purified, 
annealed, ligated and cloned in appropriate vectors. 

In addition, essentially any nucleic acid can be custom ordered from any of 
a variety of commercial sources, such as The Midland Certified Reagent Company 
(mcrc@oligos.com), The Great American Gene Company (hltp://www.genco.com), 
ExpressGen Inc. (www.expressgen.com), Operon Technologies Inc. (Alameda, CA) and 
many others. Similariy, peptides and antibodies can be custom ordered from any of a 
variety of sources, such as PepiidoGenic (pkim@ccnet.com), HTI Bio-products, inc. 
(http://www.htibio.com), BMA Biomedicals Ltd. (U.K.), Bio.Synthesis, Inc., and many 
others. 

Certain polynucleotides of the invention may also obtained by screening 
cDNA libraries (e.g., libraries generated by recombining homologous nucleic acids as in 
typical diversity generation methods, such as, e.g., shuffling methods) using 
oligonucleotide probes which can hybridize to or PCR-amplify polynucleotides which 
encode the interi'eron homologue polypeptides and fragments of those polypeptides. 
Procedures for screening and isolating cDNA clones are well-known to those of skill in 
the art. Such techniques are described in, for example, Sambrook et al. Molecular 
Cloning - A Laboratory Manual (2nd Ed.), Vols. 1-3, Cold Spring Harbor Laboratory, 
Cold Spring Harbor, New York, 1989 (hereinafter "Sambrook") and Current Protocols in 
Molecular Biology, F.M. Ausubel et al, eds.. Current Protocols, a joint venture between 
Greene Publishing Associates, Inc. and John Wiley & Sons, Inc. (supplemented through 
1999) (hereinafter "Ausubel"). 

As described in more detail herein, the polynucleotides of the invention 
include sequences which encode novel mature interferon-alpha homologues and sequences 
complementary to the coding sequences, and novel fragments of such coding sequences 
and complements thereof. The polynucleotides can be in the form of RNA or in the form 
of DNA, and include mRNA, cRNA, synthetic RNA and DNA, and cDNA. The 
- polynucleotides can be double-stranded or single-stranded, and if single-stranded, can be 
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the coding strand or the non-coding (anti-sense, complementary) strand. The 
polynucleotides optionally include the coding sequence of an interferon-alpha homologue 
(i) in isolation, (ii) in combination with additional coding sequence, so as to encode, e,g.y a 
fusion protein, a pre-protein, a prepro-protein, or the like, (iii) in combination with non- 

5 coding sequences, such as introns, control elements such as a promoter, a terminator 
element, or 5* and/or 3* untranslated regions effective for expression of the coding 
sequence in a suitable host, and/or (iv) in a vector or host environment in which the 
interferon-alpha homologue coding sequence is a heterologous nucleic acid sequence or 
gene. Sequences can also be found in combination with typical compositional 

10 formulations of nucleic acids, including in the presence of carriers, buffers, adjuvants, 
excipients and the like. 

The term DNA or RNA encoding the respective interferon-alpha 
homologue polypeptide includes any oligodeoxynucleotide or oligodeoxyribonucleotide 
sequence which, upon expression in an appropriate host cell, results in production of an 

15 interferon-alpha homologue polypeptide of the invention. The DNA or RNA can be 

produced in an appropriate host cell, or in a cell-free (in vitro) system, or can be produced 
synthetically (e.g., by an amplification technique such as PCR) or chemically. 

Using Polvnucleotides of the Invention 

The polynucleotides of the invention have a variety of uses in, for example: 
20 recombinant production (i.e., expression) of the interferon-alpha homologue polypeptides 
of the invention; as therapeutics and prophylactics, e.g., for use in methods of therapeutic 
and prophylactic treatment of a disease, disorder or condition; for use in, gene therapy 
methods and related applications;; as inutiunogens; for use in diagnostic and screening 
assays; as diagnostic probes for the presence of complementary or partially 
25 complementary nucleic acids (including for detection of natural IFN- alpha coding 
nucleic acids); as substrates for further reactions, e.g., shuffling reactions or mutation 
reactions to produce new and/or improved IFN-alpha homologues, and the like. 

EXPRESSION OF POLYPEPTIDES 

In accordance with the present invention, polynucleotide sequences which 
30 encode novel and/or mature interferon-alpha homologues, fragments of interferon-alpha 
. proteins, related fusion proteins, or functional equivalents thereof, are collectively referred 
to herein as "interferon-alpha homologue polypeptides," "interferon-alpha homologue 
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proteins," or "inierferon-alpha homologues " "interferon homologues," "IFN-alpha 
homologues," "EFN homologues", "IFN polypeptides," "IFN proteins" "polypeptides of the 
invention," or "proteins of the invention." Polypeptide or amino acid fragnients of each of 
the preceding terms are also intended to be included and encompassed in the polypeptides 

5 or proteins of the invention. Such polynucleotide sequences of the invention are used in 
recombinant DNA (or RNA) molecules that direct the expression of the interferon-alpha 
homologue polypeptides in appropriate host cells. Due to the inherent degeneracy of the 
genetic code, other nucleic acid sequences which encode substantially the same or a 
functionally equivalent amino acid sequence are also used to clone and express the 

10 interferon homologues. 

Modified Coding Sequences 

As will be understood by those of skill in the art, it can be advantageous to 
modify a coding sequence (including, e.g., a nucleotide sequence encoding an interferon- 
alpha homologue of the invention or a fragment thereof) to enhance its expression in a 

15 particular host. The genetic code is redundant with 64 possible codons, but most 

organisms preferentially use a subset of these codons. The codons that are utilized most 
often in a species are called optimal codons, and those not utilized very often are classified 
as rare or low-usage codons {see, e.g., Zhang S.P. et al (1991) Gene 105:61-72). Codons 
can be substituted to reflect the preferred codon usage of the host, a process called "codon 

20 optimization" or "controlling for species codon bias." 

Optimized coding sequence containing codons preferred by a particular 
prokaryotic or eukaryotic host {see also Murray, E. et al (1989) Nuc. Acids Res, 17:477- 
508) can be prepared, for example, to increase the rate of translation or to produce 
recombinant RNA transcripts having desirable properties, such as a longer half-life, as 

25 compared with transcripts produced from a non-optimized sequence. Translation stop 
codons can also be modified to reflect host preference. For example, preferred stop 
codons for 5. cerevisiae and mammals are UAA and UGA. respectively. The preferred 
stop codon for monocotyledonous plants is UGA, whereas insects and E. coli prefer to use 
UAA as the stop codon (Dalphin M.E, et al. (1996) Nuc, Acids Res, 24:216-218). 

30 The polynucleotide sequences of the present invention can be engineered in 

order to alter an interferon homologue coding sequence for a variety of reasons, including 
^ but not limited to, alterations which modify the cloning, processing and/or expression of 
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the gene product. For example, alterations may be introduced using techniques which are 
well known in the art, e.g., site-directed mutagenesis, to insert new restriction sites,' to alter 
glycosylaiion patterns, to change codon preference, to introduce splice sites, etc. 

Vectors, Promoters and Expression Systems 
5 The present invention also includes recombinant constructs comprising one 

or more of the nuileic acid sequences as broadly described herein ( e.g., those encoding an 

interferon-alpha homologue of the invention or a fragment thereoO- The constructs 

comprise a vector, such as, a plasmid, a cosmid, a phage, a virus (including a retrovirus), a 

bacterial artificial chromosome (BAC), a yeast artificial chromosome (YAC), and the like, 

10 into which a nucleic acid sequence of the invention has been inserted, in a forward or 
reverse orientation. In a preferred aspect of this embodiment, the construct further 
comprises regulatory sequences, including, for example, a promoter, operably linked to the 
sequence. Large numbers of suitable vectors and promoters are known to those of skill in 
the art, and are commercially available. 

15 General texts which describe molecular biological techniques useful herein, 

including the use of vectors, promoters and many other relevant topics, include Juo. P-S., 
Concise Dictionary of Biomedical and Molecular Biology (CRC Press 1996); 
Singleton et a/.. Dictionary of Microbiology and Molecular Biology (2d ed. 1994); 
The Cambridge Dictionary of Science and Technology (Walker ed„ 1988); Hale & 

20 Marham, THE Harper Collins Dictionary of Biology ( 1 99 1 ); Scott and Mercer, 
Concise Encyclopedia of Biochemistry and Molecular Biology (3d ed. 1997); 
Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology, 
volume 152 Academic Press, Inc., San Diego, CA (hereinafter "Berger"); Sambrook et a/.. 
Molecular Cloning - A Laboratory Manual (2nd Ed.), Vol. 1-3, Cold Spring Harbor 

25 Laboratory, Cold Spring Harbor, New York, 1989 ("Sambrook") and Current Protocols in 
Molecular Biology, F.M, Ausubel et al., eds.. Current Protocols, a joint venture between 
Greene Publishing Associates, Inc. and John Wiley & Sons, Inc. (supplemented through 
1999) ("Ausubel")). Examples of techniques sufficient to direct persons of skill through 
in vitro amplification methods, including the polymerase chain reaction (PCR), the ligase 

30 chain reaction (LCR), QP-replicase amplification and other RNA polymerase mediated 
techniques {e.g., NASBA), e.g,, for the production of the homologous nucleic acids of the 
invention are found in Berger, Sambrook, and Ausubel, as well as Mullis et al (1987) U.S. 
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Patent No. 4,683,202; U.S. Pat. No. 4,683.195, issued July 28, 1997; PCR Protocols: A 
Guide to Methods and Applications (Innis et al, eds.) Academic Press Inc. San Diego, CA 
(1990) (Innis); Amheim & Levinson (October 1, 1990) C&EN 36-47; The Journal Of 
NIH Research (1991) 3. 81-94; (Kwoh et al (1989) Proc, Nat 7 Acad, Sci, USA 86, 1 173; 

5 Guatelli et al. (1990) Proc. Nat'lAcad. Sci, USA 87, 1874; Lomell et al (1989) 7. Clin, 
Chem. 35, 1826; Landegren et al (1988) Science 241, 1077-1080; Van Brunt (1990) 
Biotechnology 8, 291-294; Wu and Wallace (1989) Gene 4, 560; Barringer et al (1990) 
Gene 89, 1 17, and Sooknanan and Malek (1995) Biotechnology 13:563-564. 

PCR generally refers to a procedure wherein minute amounts of a specific 

10 piece of nucleic acid, RNA, and/or DNA, are amplified by methods well known in the art 
(see, e.g., U.S. Pal. No. 4,683,195 and other references above). Generally, sequence 
information from the ends of the region of interest or beyond is used, for design of 
oligonucleotide primers. Such primers will be identical or similar in sequence to the 
opposite strands of the template to be amplified. The 5' terminal nucleotides of the 

15 opposite strands may coincide with the ends of the amplified material. PCR may be used 
to amplify specific RNA or specific DNA sequences, recombinant DNA or RNA 
sequences, DNA and RNA sequences from total genomic DNA, and cDNA transcribed 
from total cellular RNA, bacteriophage or plasmid sequences, etc. PCR is one example, 
but not the only example, of a nucleic acid polymerase reaction method for amplifying a 

20 nucleic acid test sample comprising the use of a another (e.g., known) nucleic acid as a 
primer. Improved methods of cloning in vitro amplified nucleic acids are described in 
Wallace et al, U.S. Pat. No. 5,426,039. Improved methods of amplifying large nucleic 
. acids by PCR are summarized in Cheng et al (1994) Nature 369:684-685 and the 
references therein, in which PCR amplicons of up to 40kb are generated. One of skill will 

25 appreciate that essentially any RNA can be converted into a double stranded DNA suitable 
for restriction digestion, PCR expansion and sequencing using reverse transcriptase and a 
polymerase. See Ausubel, Sambrook and Berger, all supra. 

The present invention also relates to host cells which are transduced with 
vectors of the invention, and the production of polypeptides of the invention (including 

30 fragments thereof) by recombinant techniques. Host cells are genetically engineered (i.e., 
transduced, transformed or transfected) with the vectors of this invention, which may be, 
^ for example, a cloning vector or an expression vector. The vector may be, for example, in 
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the form of a plasmid, a viral particle, a phage, etc. The engineered host cells can be 
cultured in conventional nutrient media modified as appropriate for activating promoters, 
selecting transformants, or amplifying the interferon homologue gene. The culture 
conditions, such as temperature, pH and the like, are those previously used with the host 
5 cell selected for expression, and will be apparent to those skilled in the art and in the 
references cited herein, including, e,g,, Freshney (1994) Culture of Animal Cells, a 
Manual of Basic Technique, 3d ed., Wiley-Liss, New York and the references cited 
therein. 

The interferon homologue polypeptides and proteins of the invention can 

10 also be produced in non-animal cells such as plants, yeast, fungi, bacteria and the like. In 
addition to Sambrook, Berger and Ausubel, details regarding cell culture can be found in 
Payne et al (1992) Plant Cell and Tissue Culture in Liquid Systems, John Wiley & Sons, 
Inc. New York, NY; Gamborg and Phillips (eds.) (1995) Plant Cell Tissue and Organ 
Culture; Fundamental Methods, Springer Lab Manual, Springer- Verlag (Berlin 

15 Heidelberg New York) and Atlas and Parks (eds.) The Handbook of Microbiological 
Media (1993) CRC Press, Boca Raton, FL. 

The polynucleotides of the present invention may be included in any one of 
a variety of expression vectors for expressing a polypeptide. Such vectors include 
chromosomal, nonchromosomal and synthetic DNA sequences, e.g., derivatives of SV40; 

20 bacterial plasmids; phage DNA; baculovirus; yeast plasmids; vectors derived from 

combinations of plasmids and phage DNA, viral DNA such as vaccinia, adenovirus, fowl 
pox virus, pseudorabies, adenovirus, adeno-associated virus, retroviruses and many others. 
Any vector that transducers genetic material into a cell, and, if replication is desired, 
which is replicable and viable in the relevant host can be used. 

25 The nucleic acid sequence in the expression vector is operatively linked to 

an appropriate transcription control sequence (promoter) to direct mRNA synthesis. 
Examples of such promoters include: LTR or SV40 promoter, £ coli lac or trp promoter, 
phage lambda Pl promoter, and other promoters known to control expression of genes in 
prokaryotic or eukaryotic cells or their viruses. The expression vector also contains a 

30 ribosome binding site for translation initiation, and a transcription terminator. The vector 
optionally includes appropriate sequences for amplifying expression. In addition, the 
' expression vectors optionally comprise one or more selectable marker genes to provide a 
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phenotypic trait for selection of transformed host cells, such as dihydrofolate reductase or 
neomycin resistance for eukaryotic cell culture, or such as tetracycline or ampicillin 
resistance in £. coli. 

The vector containing the appropriate DNA sequence as described herein, 

5 as well as an appropriate promoter or control sequence, may be employed to transform an 
appropriate host to permit the host to express the protein. Examples of appropriate 
expression hosts include: bacterial cells, such as £ coli, Streptomyces, and Salmonella 
typhimurium\ fungal cells, such as Saccharomyces cerevisiae, Pichia pastoris, and 
Neurospora crassa\ insect cells such as Drosophila and Spodoptera frugiperda\ 

10 mammalian cells such as CHO, COS, BHK, HEK 293 or Bowes melanoma; plant cells, 
etc. It is understood that not all cells or cell lines need to be capable of producing fully 
functional interferon homologues; for example, antigenic fragments of an interferon 
homologue may be produced in a bacterial or other expression system. The invention is 
not limited by the host cells employed. 

15 In bacterial systems, a number of expression vectors may be selected 

depending upon the use intended for the interferon homologue. For example, when large 
quantities of interferon homologue or fragments thereof are needed for the induction of 
antibodies, vectors which direct high level expression of fusion proteins that are readily 
purified may be desirable. Such vectors include, but are not limited to, multifunctional £. 

20 coli cloning and expression vectors such as BLUESCRIPT (Stratagene), in which the 
interferon homologue coding sequence may be ligated into the vector in-frame with 
sequences for the amino-terminai Met and the subsequent 7 residues of beta-galactosidase 
so that a hybrid protein is produced; pIN vectors (Van Heeke & Schuster (1989) J, Biol 
Chem. 264:5503-5509); pET vectors (Novagen. Madison WI); and the like. 

25 Similarly, in the yeast Saccharomyces cerevisiae a number of vectors 

containing constitutive or inducible promoters such as alpha factor, alcohol oxidase and 
PGH may be used for production of the interferon homologue proteins of the invention. 
For reviews, see Ausubel et al (supra) and Grant et al (1987; Methods in Enzymology 
153:516-544). 

30 In mammalian host cells, a number expression systems, such as viral-based 

systems, may be utilized. In cases where an adenovirus is used as an expression vector, a 
coding sequence is optionally ligated into an adenovirus transcription/translation complex 
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(,993) CRC Press, Boca Ra.on. ?L. A** ^^^^^^^^ ^„„„„ c.mioS- 

available conrmercial l,.era,ure such as .be U^S 
,«fro,nS,gn,a-.dricMnc<..^^.^^^^^^ 
C„l.»r.C«»lo8"e and supplemem (1997) 

30 ("Sigma-PCCS"). ,1™, can be recovered and purified from 

Polypcp.ides of .be .nvenbon can be 

^„„binan.cellc„l.uresbyanyo,anumberofme.bodswe 
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Chromatography. afOni.y chromatography us,ng any ° 

steps canoe us u„ /HPLCUan be employed in the final 

high performance liquid chromatography (HPLC) P > . ^^j^^ 

;Ia.>„„.ep...aa.,o„.o,h=r„« 

,,,c,ho.arc»=nK„ow„in.he— S...^ 

^•"•"?ori-or..B„g..*„ar.sa„..n^ 
PracricaApproach.iRl.f^'^'^'^ \ mL Press at Oxford, Oxford, England; 

on CD-ROM, Humana Press. NJ. 

^i^^ii^^^^^^^^^^^^ systems can also be employed to produce 

Cell-free transcnption/transiation sysicui 

r,MA, or RNM of the p««n. ■av.nUon. Several such systems are 

„„,«u/«r B,V,l<.sy. volume 37, Oariand Publishing. NY. 

25 ^ ,,cm,v he advantageous in. for example, (a) 

«.ds.^epresenceofmod,nedam,no^^^^^^ 

LJlyor^.— allyd^^^^^^^^^ 
30 glycosylation at N-X-Sn- motifs dunng expression m 

synthetic means. 
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3.,„o aC, a suUate. amino aci., '^''^^^J^;^^^ ,,,,, » pEO-y.a«d anOno a 
„.o aC, an ace.y-a.ecl an,i„o ac... an a^ .e. „a .he 

.o,.ny,a.e.a™noac>.,acaH>o.y.a.e -^^^^ 
, ,>..K.=.ncesaae,„a,e.o..a=oe^^^^^^^^^ 

po,yn.c.o,i.=s and ^.^P 

.0 „p.«slon, ot .he «co..nan, '-^'^"^^^ in vivo and « — o. a 

o, disease, disorder, and -d-- ^^^^ ^^^^^ . vane., 

^U-od., such as d,agnos.,c and scree mng ^^^^^ 
oHisea«s.diso.ders.andcona,.ions... ca ^^^^^^^ 

,5 baseddisorders) in a vanety of subjecs (e.g. .„ „ 

' lodsandDNA-orKNAWdeve..^^^^^^^^^^^ 

„,„,oHnv«.hio.osiea..yacUvepoiyP^^d.°^^^^^^ 

.„s, o^an, ^n, hodi., sys^n. o a^ n^ ^^^^^^^^^ ^^^^^^ ^ ,„ ^ 
.,.e., «c.)-. as DNA vaccines, mu .«o.P ^^^^ ^^^^^ ,^„rs. 

,0 .be.apen.ic ,rea.men. of a vane.y ^ 

,,.,-based disorder, •»^°»-"-'"^;" ,„„u„e response in a suhjec; as a 
.™n.alsV. as ad,uvanu .o enhance or au^«^^ ^ . 

..™,nen. of a mnUiple-s.ep boos..ng vacc.na..on ™ ^coding a 

;:::::,na,ionbyde,weryofaoN.«^^^ 

,3 ;.ypeP-of.heinven.ionorenc^^^^^ 

o, a polypep.,de (..g.. a Po'VpeP-de * ,o„p,emen.ar, nuc,e,c acds 

^1 for .he presence of comp'-enury or an, J ^ 
Un.forde,ec,,onofna.ora,n.r^-^;;^^^^^^ 

,„„ber reaCons, shufn.nl " .^^a homoiogues and new 

30 «ac.ions.oproducene.and.or..p.o— 

..erferon-afphanucie^ac,^^^^^^^^^^ 
or prophylactic properties, ana 
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tn identify new and/or improved 

i„v„uon, or comp.en«n.s of the P*" ,„ express a 

.eo.p«s.ar-r.peu.,canyorpropV--''V__^^._^^^^^^^ 

«„«peu«ca,>yusefu.prodoc-^.« ^;„, .^„p,>c,.de a n,uUU... o. 

or « ..vo app«c»o., "^^^^^^^ aUered m ce«s. Such ™e*o.^ 
^..ioues by wWc. «P-'° ;7 „ ,Herapeu-ica..y or 
instance, O-e introducuon o. genes .or exprc ^^^^^^^^^ ^^.^n, 

„ p^opMaCicaUy »sef».poW.^''«' " ^^.in, v.«h a rerrov,,- con,pris,ng *e 

polynucleotides and/o, polypepi.<les of ^„„3,„,, 
::pHsesad.Uona,e.ose„ous,e..>«^- 
^uences. fnoneaspecurhe— J- g^^^^^^^^^^^^^^^ 

or.erapeu,>ca,.,ueadnga*sease.*»Me 

„,,,„„,byad.nims«ri.ginv.vo.«v« 

l„,enUonaescri^Heref„'Ooneor™«ee„s^ 

..„.aUncM.g.».---;^:.„,,,,,.c..a.ra..^ 
h»s.er, horse, sheep; or a ^„ 
,0 d>.«oraf,sh,or.vene.ra,e,a3J^n^^ 

,„ another aspect, the -nvenuon p , 
^,e*aUy.rea.>.gadfsease,*sorde,or-^^^^^ 

„ea».«n. by adn^nistering v.- - ^,„,„„„, .Hose der,.«<. 

invention <.escH>»-^-"'°°"=°"""°" 

' .. ./iMnil below. 



"asa-Hhedintr^reaetalfhelo. 



BdiWBii^B^^^^n. interferon homologoe polypepudes of the 

invention are panicufariy useful for .n ^ ^^ntp.e. 

app.ic«ions.uslng.echn,<,uesweU.no»^o^^^ 
30 lredcel.sareengineerea««v^.,h P^^^ Ceffs ma, aiso he en.neered . wv. 

-rf::::::::-;-- 



10 



. vivo or ex vivo 

,„6 U.S.Pa>. NO- _ ,33., BngK» (1991) ^^^^^^^ j 

M.iSrf.298:2«-«'.N 3„,Wanga.dHuan ( ^^^^^^^^^ 

.8.. - ■ Ca,-" « ^'"'^ ^""^ TLr. Ceil Mo,. BW- 

(.995) cancer Hes. 55:1 «^ , „. 19M ^^^^^ 

:: 8 C"^' ■ ^1.65.), and many o*er 

Res. 691:76-82, Crys , ,.996) J- Neuros«rg. 85.t)^ .„ic ooWnucleoude 
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.,nl (1991) Hum. Gene T/.er. 2:215). 

0,. appropriate vec,o«. p^„cols, ' ? 

General texts «l»eh descnbe ge .^^^ p^„^„,s. 

....present — .,—t—c^^^^ 

afflSSDsei££!!aa!sM„„ „, nucleic acids of the invention as gene 

* aiso useM for sense and anti-sense 
.epiacemen. nucleic acids, the nucleic acds - ^ ^^^^ 

;p,essionofe.pression.....oaoW^^ 
Jention.oncee.pressionofthen.«^^ 
.,.„„cle,cac,dsoftheinven„on, rs^^^ 

.l^heusedtoblocvexpresstonofnamran, ^ ^^^^^^^^^„„..„„ 

!l,y of sense and anti-sense -^'-^^ , p„„ica, Appro.* ml. Press at 

Oxford university. Oxford, t 8 ^rited therein. 
HumanaPress.NJ.and.heref.rencesc,ted 

a.nii....yaiMSpepti^^ 

?i^;5^«iS»des and polyp P „, .^e invention) may 

„,,.an.i^ies.«-o-npns,n.pol.^--^^^^^^^^^ 
:e.p.o,edrortherapeu.,candpro,.y.-^^^^^ 

p^armac^tical ca^er. Such ^ f .,e invent.on, and a 

effective an«unt o, the polynudeofde or polP^P ,„ep.ab1e earner 

^^laoeuticallyacceptahlecarriero^x^-^ 
noontpassesanyofthestandardpha™^-^^^^^^^ 

.^.orexcipientinclude.^-;-^^^^^^ 
buffered saline soluoonVdext'O*- 
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oil/water or water/oil emulsion), various types of wetting agents and/or adjuvants, and 
combinations thereof. Suitable pharmaceutical carriers and agents are described in 
Remington's Pharmaceuticajl Sciences (Mack Publishing Co., Easton, 19'^ ed. 1995). 
The formulation should suit the mode of administration of the active agent (e.g., 
5 nucleotide, polypeptide, vector, cell, etc). Methods of administering nucleic acids, 

polypeptides, vectors, cells, antibodies, and proteins are well known in the art, and further 
discussed below. 

Use as Probes 

Also contemplated are uses of polynucleotides, also referred to herein as 
10 oligonucleotides, typically having at least 12 bases, preferably at least 15, more preferably 
at least 20, 30, or 50 bases, which hybridize under at least highly stringent (or ultra-high 
stringent or ultra-ultra- high stringent conditions) conditions to an interferon homologue 
polynucleotide sequence described above. The polynucleotides may be used as probes, 
primers, sense and antisense agents, and the like, according to methods as noted supra, 

15 SEQUENCE VARIATIONS 

Silent Variations 

It will be appreciated by those skilled in the art that due to the degeneracy 
of the genetic code, a multitude of nucleic acids sequences encoding interferon homologue 
polypeptides of the invention may be produced, some which may bear minimal sequence 
20 homology to the nucleic acid sequences explicitly disclosed herein. 
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TableJ. 
Codon Table 



, Amino acids 

"AbnTne 
I Cysteine 

[ Aspartic acid Asp 

Glutairoc acid Glu 

I phenylalanine Phe 

, Glycine ^ly 

I Hislidine His 

Isoleucine He 
1 Lysine 
1 Leucine 

Methionine Me 

I Asparagine As 

Proline P"^' 

I Glutamine Gl 

I Arginine Ai 

I Serine S< 

I Threonine T 

1 Valine ^ 
1 Tryptophan 1 
1 Tyrosine 



[ Codon 

fOCA" 
lUGC 
D GAC 
E IGAA 

luuc 

.GGA 
H CAC 



1 AUA 
1 AAA 
UUA 
M AUG 
N AAC 
,CCA 
1 CAA 
I AGA 
I AGC 
, ACA 
iGUA 
W UGG 
Y 



GCC~GCG^Cir 

UGU 
GAU 
GAG 

GGC GGG GGU 
CA13 

AUG Am) 

AAG PUG CUU 

UUG CUA cue CUU 



CCC COG ecu 

rrr CGG CGU 
AGG CGA CGC CU 
AGU UCA UCC UCG 
ACC ACG ACU 
GUC GUG GUU 

UAU 



15 



1^ 

fTable 1) shows that codons 
ihp codon table (. i aoic i ^ <> 

,OA>OO.COA,CC3C.COG.-<^OU.^^^^^^ 

codon, *e codon can ^ aU.«d .» .n^* , r„ a sequence 

„H.o..aUcnng.hec,coae*po.yP^.*"-" 

„„esponds.oT in aDNA sequence concspondin^to 

Usmg, as an ewmpte, nuci ^ ^^^^ variation of tte 

„„c,eo,idc.^.50fSEQroNaiT^O^^^^^ 

^uence includes TGC GAC TTA CCA CAA ^ ^^.^^ 

l3=,ucnccCDUPQ.co.espondins>o»™o» 

Such"sil=nivartat.ons a«««sp ^ ,^„„ i„ a nudcc ac.d 

,.a.o.;.iscosscd..o». O-....-^^^^ 

: C»cep.AUG,-cMso.^n.-^^^^^^ _,.„.eac. 
Standard techniques to encode a fun 

41 
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seiue,«e. The invemion prov,d« «ch and every P ^^^^^ .^^ 

c„mb,na„o„s ba«d on poss.b>e eodon chcces^J^ ^^^^^ ^^^^^^^ 

All such vanaiions oi every nu^i^t 

l:L..nor,Hese,uence.con,bin.ao„wUMHe,ene.,eeoae. 

"""rtSe. vaHaUons" or. .mp.,, "conservarWc vana„ons' 
..Conserva,,.e,y r.^^ ^^^^ ^ 

of a particular nucleic acid sequence refers ^^^^^^ ^^.^ ^ 

„ essentially identical amino acid sequences, or^ 

.a^i^acidsequence — 

that individual substttufons, delet.ons 
.^„„.cidora3.allpe.centaseoranunoa^^^ P-^^^ 

less than 4*, 3», 2% or 1*) in an "^'^^ ^ ^„,„„ .edition of an 
...„.-«be.etheaUera.i.s.^U.n^^^^^^^^^^^ 

^ino acid, or substitution of an am.no ac d 

are well known in the art. Table 
"conservative substitutions" for one another. 

Table2 

Conservative Substitution Groups 
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T,„s -conservatively subsntutedvariations" or "conservative 

the nresent invention include 

substitutions- of a listed polyP'P'* ^"^'^ """" 
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3, 2* or of .he amino aci* of po.«..p..Oe 
.,..a..oac.of..sa..— 

For example, a conservatively noituiions" according to 

• c;FO id NO-36 will contain "conservauve substitutions . 
identified herein as SEQ ID INU.^o • . ^ about 5% of the amino 

,He Six g;oups .er.«a above, in up .o abou, S or 9 res.aue. (,... abcu 
acias) in .he 166-amino acid po,ypep..a. ,„ 

to a ,„«ter example, .f four con«^ 
,hereg.oneorresponding.oan,inoacid,es,dues 14>.166of SEQ 

„„3e„a.ivesuhs.i.u.— ^^^^ 
5 subs.i.u.ionsa.eunderl.ned) L,s,,ng P ^^^^^^ 
above subsu.ation uble, provides an express nsnnj 

proteins. ... j„ ,i,er Ihe encoded acivi.y of 

Finally ,he addition of sequences wh.ch do no. a..e 

„„,,...j:ec:;e,s„chas.eaddi.ionofanon.func,iona,se,.nce,,.a 

30 conserva.,veva,ia.,onof,hebas.n^^^^^^^ 
one of ord,nao^ sk.ll «• PP 
nucleic acid cons.mcu »h,ch are ,„ "silen. 
. .orexa,np,e,asdiscussedabove.o.i.,o.^^^^ 

s„bs.i.».ion»" «... sub..,.»..ons .n a nucle,c J ^^^ence 
,3 a..era.ioninane„codedpo,ypep..)are.™^^^^^^^^ 



30 of the present invention. 
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Nucleic Acid Hybridization 

Nucleic acids "hybridize" when ihey associate, typically in solution. 
Nucleic acids hybridize due to a variety of well characterized physico-chemical forces, 
such as hydrogen bonding, solvent exclusion, base stacking and the like. An extensive 

5 guide to the hybridization of nucleic acids is found in Tijssen (1993) Laboratory 

Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic Acid 
Probes, part I, chapter 2, "Overview of principles of hybridization and the strategy of 
nucleic acid probe assays," (Elsevier, New York), as well as in Ausubel, supra, Hames 
and Higgins (1995) Gene Probes l\ IRL Press at Oxford University Press, Oxford, 

10 England (Hames and Higgins 1) and Hames and Higgins (1995) Gene Probes 2, IRL Press 
at Oxford University Press, Oxford, England (Hames and Higgins 2) provide details on the 
synthesis, labeling, detection and quantification of DNA and RNA, including 
oligonucleotides. 

"Stringent hybridization wash conditions" in the context of nucleic acid 
15 hybridization experiments, such as Southern and northern hybridizations, are sequence 
dependent, and are different under different environmental parameters. An extensive 
guide to the hybridization of nucleic acids is found in Tijssen (1993). supra, and in Hames 
. and Higgins I and Hames and Higgins 2. supra. 

For purposes of the present invention, generally, "highly stringent" 
20 hybridization and wash conditions are selected to be about 5° C or less lower lower than 
the thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH 
(as noted below, highly stringent conditions can also be referred to in comparative terms). 
The T„, is the temperature (under defined ionic strength and pH) at which 50% of the test 
sequence hybridizes to a perfectly matched probe. Very stringent conditions are selected 
25 to be equal to the T^ for a particular probe. 

The Tro is the temperature of the nucleic acid duplexes indicates the 
temperature at which the duplex is 50% denatured under the given conditions and its 
represents a direct measure of the stability of the nucleic acid hybrid. Thus, the Tm 
corresponds to the temperature corresponding to the midpoint in transition from helix to 
30 random coil; it depends on length, nucleotide composition, and ionic strength for long 
stretches of nucleotides. 
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... • nhvhridized nucleic acid material can be removed by 
. fAfter hybridization, unhybndizeanuc desired 

«,hich can be adjusted depending upon the desired 
a series of vvashes. the stnngency of wh.ch can be temperature) 
.suits. LOW stnngency washing conditions (e.g., usmg higher salt a d 

J ,.„„n<ncdric hybtidizalion signals aiKl fig" 
increase sens .y. bu, can produc. nonspecf c hy 

.ackg^und Signals. Higher s.Hngency cond,„cns 

„„pera.ure .ha. is closer ,o ,He H^bH^i-on .emp^ai.^ ^^^^^ ^ ^ ^ 

.„ica„y wi.h only speciHc signal '''-''^I'^. l^^ ,^^ and 

..^,.lc.s„^^^^^^ 

"•"""""^ T ro--..5X.i6.6(,og,«.0.4U%O.C)-0.72(%0-500'n 

:::g:=rd:rnrr."----'---^ 

See Rapley and Walker, supra. f ^iinws- 

,.,..oJ:rlMis.he.l.y.^^^^^^^^^^ 

E<,ua,ions 1 and 2 are .ypicaliy accura,e only f« hybrid duplets longer 

,han aboo. 100-200 nucleotides. Id. „„cleo.ides can be 

25 The Tm of nucleic acid sequences shorter than 

calculated as follows: 

JhL A (adentne), C, T (thy^tne). and G are the nuntbers of .he 

"^'"""""eofs.HngenthybHd.ationconditionsforHybrldi.tionof^ 
An example complementary residues on a 

-•-•^-'-t:;!is~a:-^ 

filter in a Southem.or northern blot is 50% torm 



20 



30 



45 



PCT/USOO/27781 

WO 01/25438 

An sample of s-nngem wash condUionsis 

„e hybnd,«ion being earned o.. ovemghi. 
.„..SSCwa.a.«=CroHSn.n.«.«-^^^^^ 

background probe signal. An example 

minutes. „io of 2 5x-5x (or higher) than thai observed 

In general, a signal to noise ratio of 2.5X ^ J^^^ 

, homology .o,e.s.,. he nucle,cac,ds of. he prese 

listings herein. . ji,in„s ate selecled lo be aboul 5° C or less 

AS noted, "highly stringent condu.ons 

<T 1 for the specific sequence at a oeiineu 
than the thermal melttng potnt (T„) "" J , ,„ „e nucleotide 

S^.e.g.Rapleya-O**"'™"":- . ^ „^ ,o .denttfy nucleic acids of the 
Comparative hybridtzatton can be used to ^^^^^ 

..entionand.h.scomparativehybridi.a^.-h'^-^^^^^ 

20 disUnguishing nucleic acds of the _ j„,„,i„„ wicates 

between two nucleotide se<,uences ,n the conux ^^^^^^^^ _^ 

„,a,ivel,st,ongs,ructurals,m,laH.y*omology..e.«^^^ 
.^t^ncelistingheretnHighlystringenthy^d^-^^^^ 

^^onstrates a degree of similarity or h"™ « » ^ ^ 
, arrangememororderthattsgreaterth.*.*^^^^^^^^^ 

..mditions. in paniCar. dc,ect,„n o, h.g^y .n ^^^^^ 
^ntinventionindicatesstrongstrnctutalstm.^ y 

Lleo,idestruc.t.re,basecomposit,on,a.angem^^^^^ 

p„,,dedin.heae,.«ncehst,ngsh«^^^^^^^^^ 
nucleic acids which hybndtM to the e«mp 



30 

conditions. 
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^ of ,he ,is.=d nudcc acds ^e ,. nuCeic acid «,..nc=s SEQ m NO. 

, CEO IDNO-72.0 SEQ ID NO:78. and complememarypolynucleol.de 

NO-.JS.andSEQU^^U"'" v „ri„™ni conditions, or uUra- 

conditions). Stringem hybridizafon (mclud.ng. eg, 1..8Wy s J 
stHngency,orultra-„,trahighs,ri,.g=ncyhybHd,.a„oncond,.ons)and 

eaneasilybede—emplricallyfcr-any^nnc^^^^^^^^^ 
For example, in determining highly stringent ny 
, cc.di.ions,tHel,yMdizationand»as^condi,,onsaregrad„a,ly — 

i^^asing temperature, decreasing saU concentrat.on. ---^^ ^ " 

..„Hr.,easing...eoncen..a.io„»';-^^^^^^^^^ 
.,„or»asH>.— 

hybridaation and wash cond,«ons a« gra y ^,0:35, SEQ ID 

NO:72 to SEQ ID J ^ ,„„prtsing one o, more 

a perfectly matched ""P" ""^'^'^o-, to SEQ ID NO:35. SEQ ID NO:72 to 

SBQ.DNO.S.anc^mp,«^^^^^^^ 
20 noise ratio that ,s at least 2.5X. an v ,„rt,i. case the unmatched targelis a 

hybHdi.a.ion of the probe to an unmatch«. target^." - ^^^^ 
„.,eicacidcorrespondingtoaKnownal..amt^™...^ 

. :r:rr:g^:B:Lcessionn.bers:I.^^^^^^^^^^^ 
X02.S.lpha.e>— ^^^^^ 

,ffN.,B). X03U5 (alpha-8) X029 » .Ox-l). 

V0O532 (alpha-C), X02960 ^''^'- I'^'l'^^l, V0^549 (alpha-2a), and 1083.3 
,„....0m7.I07S2..M.23. a^^^^ 
3d (alpha-Cohl). Additional such sequences ,^ 

sVill in an. Nomenclamre of dte human — ^^^^^ 3„ ^,„„ « „, a„6, 
Diaze.al.,(1996)J. We*-»»a«iCotoeR«. 16.179 
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rvPlv each of Which IS incorporated 

A ,es> nucleic acd .s sa.d » spec.. » 

of *= probe .o the ,arg« under condmons ■ « ^^^^^ ^^^^ ^, ,^^3. 

lpeKeca,n.a,cMco.p..en.ary.ar.,^^^^^^^^ 

unmatched targe. nuc.e,c acids represen b, ^ ^^^^ (^.^.^.h,, 

,.pba.20,X0.955(a.pba-4b>.V<«5 ^,^3^^ 

V00549(alpha-2a), and 108313 (alpn ^ 

--'-^'"^rbi.-.--— ^ 

..stnngenc,o,b,bHdi..iona.d«— ^^^^ 

„„o for binding of tbeprob^to-be ^'^^'^ ,„ .n^aubed «e, 

U „ .east lOx as high as tbat observed for byb ^ ^ ,^^07 
,0 Uacids^P—Oen^^^^^^^^ 

,.,pba.A), X0«58 (A.pba-6), X02956 ( P ^^^^^ xOZ955 

V00545 (IFN-1B).X03125 (a.pba-8). X02951 P p,,„<,„gene), R0067 

lia^b), V005« (a.pba-C). ^'^^^^l^ Jy^^.^ (aipba-..). and 

amilarly, even b,gber levels o 3^,,,. 
30 .„creasi„gtbeb,b.d,.nonan.orw.b— 
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m •>nX 50X 100X.or500Xorinore 
c„.p„ u.,e, nucleic acid is . ie...^. ■^^■^^ 

,,gH as ma, observed for bvbnd,.auo„ 
„,„se.ed.,Ge„Ba„.accessio..n..«^^^^^^^^^ 

<^.,B),X03USCaipba-S,.X0« (a^^^ ^^^^^^^^^^^ 
V00532 (aipba-C),X02960 '^'"^^-^ ' 3;,,v00549 (aipb.-2a). and 1083.3 
,0,6,4. ,0,1S7. T07S2., M.2350 ^,^„.ed in OenBan., can be 

C.,pba.Con„,o, Cher similar ^ ^ „„aer such condiiions, wi.b a 

..„.ir^.Mar..nuc,eicac.dwb.b^^^^^^^^^^ 
,,S„a,.onoisera.ioofa,.eas,.A.ha.of ^ 

„„c,eicacidissaid.oi>ind,o.beprob^ ^^^^^^^^^ 
Wnuc-eicacidswbKbb^^n^^^^^^ 

SEQIDNO., •oSEQIDNO.JSandSWro ^^^,„„, 

Wgb and u,rra.u,.ra h.gb s.H"Ee-V » fe» siien. or conservarive nuCeic acid 
,„ch nnde,c acds inCude *ose «..b «^ " 

subs-ions as compared .o . S'-" """' ,„ ,,,h Cher under stringent 

,.„c,eicacids»bicbdon.M-^^^^ 

eo„di,icns are s.„, subsranria,., iden..ca, . created using rbe 

subsunriaiiy idenrrcai This JLde,o^ -seraor anriserum 
„axin.,ncodondesenerac,pc™«^^>_;^^;^^^^^,„,0,^SBQn,NO..^ 

generared agains. one or nnore of SEQ ^„p^,to encoded b, known 

U,PNO..SS.bicbbasbeensub.,ac« - -^^^^^ 

„-a,pbase,uences.inciud,ng.. ...^ 

a,pba nuCeic acid sequences in <^'^^]-^^ ,,p..H). V0O542 (aipba- H), 

CMpba-M.X02«S(Mpba.6,«^-«^^^^ 

V00545 CIFN-lB).X03i« (^X' ^;^"''' * /^^j^s, (a.pba-10 pseudogene), R0067 
C.p.a^b,, V00532 (a,pba-C,, ^"^'^^^'^X^,,^, V0O549 (Mpba-aa). and 
ol„. .016.4, ,0,7S7, .0782., ^^^''"^^^^ presen.ed in GenBan. 

,„S,,3(a,pba.Con.,,oro,Uersi™ar.^^^ P ^ ^^^^^,^^„„„,„„^ 

P^Herdeui.soninununo,ogica,,en<.a-^^^^^^^^^ 
about 100 nucleotides, aTMACi ny 
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A c„ . o W U eial 1 Nucleic ^dds Res. (Sepl. U. 1991) 
in iht an can be used. See, e.g., borg, u. ei at. 

19(17) ineorporaied herein by reference in te entirety for all purposes. 

In one aspect, the invention provides a nucle.c acid which compnses a 
nni,ue subse,uence in a nucleic acid selected fron, SEQ n> NO:, to SEQ m NO:35 or 
5 SEQ ID NO:72 to SEQ ID NO:78. The unique subsequence is unique as compared 
nucleic acid conespondirrg to any known interferon-alpha nucleic acid sequence 
rncludin. e.g.. the known sequences represented by OenBank accession number 1002 0 
Xd 1o7 ,Alpha-A), X02958 (Alpha-6), X02956 (Alpha-S), V00533 (alpha^), 
V os" alpha-M), V00.« (IFN-IB), X03123 (alpha-S). X0a957 (a,pha.,« y«, MO 
,„ <a,pha-21), X02955 (alpha-4b,, V0O532 (alpha-C), X029«,<a.pha.7,^O2 6. (.pha-,0 
seudogene), AI2109, R0067 ,0x-.). .01614, ,017S7..07S2.. M.2350 <a>pha-E), 
M38289, V0O549(alpha-2a), and .083.3 ,a.pha-Con.), or other sinu.ar,nterferon-a.pha 
sequences presented in GenBank. Such unique subsequences can be — 
■ Zing any of SEQ >D NO., to SEQ NO:35 or SEQ ID NO:72 to SEQ n> NO:78 
, agLtLLp.e.esetofnuc,eicacidscorrespon..toOe^^^^^^^^ 

Wn interferon-aipha nuc.eic acid sequences, such as, e.g '^^'"f , 
,A.pha.A), X02,58 (Alpha^). X02956 ,A,pha.5,. VC«533 (alpha^H V«0M2^h . 
V0^545 OFN-.B), X03.25 (a.pha.8). X02957 <alpha.l6,. V00540 (alpha-21). X02955 
(alpha^b), V00532 (alpha-C X02960 (alpha.7), X02961 (alpha- 10 pseudogene), 
20 A12.09(a.pha-4B).R0067(Ox-I), .0.6.4. .0,787. .0782.. M.2350 (a.pha.F), 

20 A.210»laipna Hmsll3(alnha-Conl),orothersin,ilarinterferon-a.pha 
M38289,V00549(a.pha-2a). and 1083.3 (a.pnal-oni). 

sequences presented in GenBank. Alignn«nt can be performed using the BLAST 
Zrithmltoderaultparameters, ^y unique subseqt^nce is useful, e.g.. as a prc*e to 

identify the nucleic acids of the invention. . 

Similar,, the invention includes a polypeptide Which compnse«^^ 

.mino acid subsequence in a polypeptide selected from: SEQ m NO:36 to SEQ NO.70 
or SEQ ro NO:79 to SEQ ID NO:85. Here, the unique subsequence is umque as 
compared to an amino acid subsequence of a known interferon-aipha po.ypepude 
,„c.ud,„g. e.,, an amino acid subsequence of a polypeptide encoded b, aknown 
30 interferon-aipha nuCe.c acd corresponding to any of GenBank accession num^. 

;«,2.0 (a,pha.D), 100207 (Alpha-A), X02958 (Alpha-*), X02956 ^^^^^^^^..^ 
- alpha-H). V0OM2 (alpha-.4, V00545 (IFN-IB). X03125 (alpha.8), X02957 (alpha-.*), 
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4«V00532»ha-C).X02960.(a.pha-7).X<«96i 
,,^..„pseuao,e„e,,R0067(O.a» H^^^^^^ 

M38m V0O549 (aipha-2a). and ,083 3 (.p ^,^,„a 
„„*>cac>aorpo>ypepaaese,.enc=sp^^^^^^^^ 

is agaim. .h. co™p,e>e s=. of ^ow ^^„,,„„ 
as.hos.po,ypep.i*scncodeab,nucU.cac,*- J^^^^^^^^^^^ 
lJ00.ma.p.^-O).00.07(.P^^^^^^ 

V00533 (a>p.a-H). V00542 C*^- * ' ''^ J.) V00532 (..pha-O, X02960 (a>pha-^). 
V00540(a,pHa.2O.>»«^^^-^;^^ 

n, . M38289,V00549 (alp>.a-2a). an<i 1083 P ^ ^ «quen« 

3,*asapseud08e„e.*eco«spo„.," ^PP 

.«s,aUoa of nucUic acid -i"-""" afpha i.»rfe™ 

--"-r:.,o„,.p«^.— ^^^^^^^ 

under a. teas, s,Hn,en. o, ^^^^^^^^^ . „„„ue su.se,oence ,n a 
.0 I^.-).oa„n^oecod.n80,Uo"^^^^^^^^^^ 

po,ypep.id= Pieced from: SEQ ID N03 ^ ^ ^„ ^ 

; m85, wf,=r.. .e u.,.e .Hown . OenBan. or .o a 

— ofa.nown.— ^^^^^ Un..c s..nc. are 

polypeptide corresponding to any ot 

oUgpnucfeoUde «Uh a. ,eas. abo.. „„„,,^,^ ,„ a conuo, r.-c.e.c acK> 
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the unique coding oligonucleotide with at least a 2x higher signal to noise ratio as 
compared to hybridization of the control nucleic acid to the coding oligonucleotide. 
Again, higher signal to noise ratios can be selected, e.g., about 2.5x, about 5x, about lOx, 
about 20x, about 30x, about 50x or more. The particular signal will depend on the label 
5 used in the relevant assay, e.g.. a fluorescent label, a colorimetric label, a radio active 
label, or the like. 

In another aspect, the invention provides a polypeptide that comprises 
unique subsequence in a polypeptide selected from SEQ ID NO:36 to SEQ ID NO:70 and 
SEQ ID NO:79 to SEQ ED NO:85, wherein the unique subsequence is unique as compared 
10 to a polypeptide sequence corresponding to a known interferon-alpha polypeptide, such as, 
e.g.r an interferon-alpha polypeptide sequence present in GenBank. 

SUBSTRATES AND FORMATS FOR SEQUENCE RECOMBINATION 

The polynucleotides of the invention are useful as substrates for a variety of 
recombination and recursive recombination (e.g., DNA shuffling) reactions, as well as 

15 other diversity generating techniques, including mutagenesis techniques and standard 
cloning methods as set forth in, e.g., Ausubel, Berger and Sambrook, supra, i.e., to. 
produce additional interferon-alpha homologues with desired properties. Based on the 
screening or selection protocols employed, recombinant, e.g., shuffled, interferon-alpha 
homologue polypeptides can be generated and isolated that confer a variety of desirable 

20 characteristics, e.g., enhanced antiviral activity, enhanced antiproliferative activity, 

increased growth inhibitory, cytostatic and/or cytotoxic activities towards particular target 
cells, reduced immunogenicity, etc. 

A variety of diversity generating protocols, including nucleic acid shuffling 
protocols, are available and fully described in the art. The procedures can be used 

25 separately, and/or in combination to produce one or more variants of a nucleic acid or set 
of nucleic acids, as well variants of encoded proteins. Individually and collectively, these 
procedures provide robust, widely applicable ways of generating diversified nucleic acids 
and sets of nucleic acids (including, e.g., nucleic acid libraries) useful, e.g., for the 
engineering or rapid evolution of nucleic acids, proteins, pathways, cells and/or organisms 

30 with new and/or improved characteristics. 

While distinctions and classifications are made in the course of the ensuing 

discussion for clarity, it will be appreciated that the techniques are often not mutually 
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„^,eic aci* .ha. encode proems w„h or wh,ch co 

a„, nucleic acids .ha, are produced can be se e ed ,„,,a„giogenic 

enhance an.Wira, aCivi.,, enhanced -''^''"'^l.L ae<ivi.ies .owards 
,c,ivu„ increased sro».h inhi«.ory. c.os.a.,c an^or c^^^ ^^^^^^^ ^^^^ 

pa„icu,ar .arge. ceiis, r^ueed ''"■^2. groWh rn.hUory. cy.os.a,ic, and/or 
acids havingenhanced an.ivrral. an.>prohferaove g _^ ^.^ 

,,...cac...,or,ednoedin«.nnogeni«.^^^^^^^^ 
i„c,„de,de„„fy,nganyac.ivi.,.ha.canbede,ec«d.^^^^^^^^^^^^_^^^^^^^^^ 

practitioner. j diversity generating 

, inven.ioninc,„de.he.ol,o.ingpuhBcaU.sa^.^^^^^^^^^^^^^ „ 
W.«<.l(20<»)"MoiecuiarBreed,ngo V,ru«s 
W.e,.U.9,9)"Mo,ec„>arhreed.go.v..«^^f^^^^^ 

prope.^esrBlwT^^^'^'^^'' "'- '' '^^ 
«,ue„cesofsuh.ihsin;-lS««^;^'^^^ 
"Evoiu.ionoracy.oKineusingONMann.— 

Minshull and Steiraner (1999) rro "Directed evolution of 

. opinion in « Bioiog, 3:2S4-Z90. C---^^^^^^^ ^ 
.,,„idine.nase,ormphosphor,,a.— ^^^^^^ 

,,,259-264; Cnunen e, o (199 ) ^^jj,,,,., c„„eH « «i. 

30 fron.diversespeciesacce.era.esdirec.edevo.^^.^^^^^^^ 

.....Moieco-are.,.™.^^^^^^^^^ 
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fucosidase from a galaciosidase by DNA shuffling and screening " Proc, Natl Acad. ScL 
USA 94:4504-4509; Patten et al (1997) "Applications of DNA Shuffling to 
Pharmaceuticals and Vaccines," Current Opinion in Biotechnology 8:724-733; Crameri et 
al (1996) "Construction and evolution of antibody-phage libraries by DNA shuffling " 

5 Nature Medicine 2:100-103; Crameri ei al (1996) 'improved green fluorescent protein by 
molecular evolution using DNA shuffling " Nature Biotechnoloev 14:315-319; Gates et al 
(1996) "Affinity selective isolation of ligands from peptide libraries through display on a 
lac repressor 'headpiece dimer/" I Mol Biol 255:373-386; Stemmer (1996) "Sexual PCR 
and Assembly PCR" In: The Encyclopedia of Molecular Biolosy. VCH Publishers, New 

10 York. pp. 447-457; Crameri and Stenuner (1995) "Combinatorial multiple cassette 

mutagenesis creates all the permutations of mutant and wildtype cassettes," BioTechnipues 
18:194-195; Stemmer et al (1995) "Single-step assembly of a gene and entire plasmid 
form large numbers of oligodeoxy-ribonucleotides" Gene 164:49-53; Stemmer (1995) 
"The Evolution of Molecular Computation " Science 270:1510; Stemmer (1995) 

15 "Searching Sequence Space," Bio/Technology 13:549-553; Stemmer (1994) "Rapid 
evolution of a protein in vitro by DNA shuffling," Nature 370:389-391; and Stemmer 
(1994) "DNA shuffling by random fragmentation and reassembly: In vitro recombination 
for molecular evolution, " Proc, Nat' I Acad Sci. USA 91:10747-10751. 

Additional details regarding DNA shuffling and other diversity generating 

20 methods can be found in the following U.S. patents. PCT publications, and EP 

publications: USPN 5,605,793 to Stemmer (February 25, 1997), "Methods for In vitro 
Recombination;" USPN 5,811 ,238 to Stemmer et al. (September 22, 1998) "Methods for 
Generating Polynucleotides having Desired Characteristics by Iterative Selection and 
Recombination;" USPN 5,830,721 to Stemmer et al. (Novembers, 1998). "DNA 

25 Mutagenesis by Random Fragmentation and Reassembly;" USPN 5,834,252 to Stemmer 
(November 10, 1998) "End-Complementary Polymerase Reaction;" USPN 5,837,458 to 
Minshull (November 17, 1998). "Methods and Compositions for Cellular and Metabolic 
Engineering;" WO 95/22625, Stenuner and Crameri, "Mutagenesis by Random 
Fragmentation and Reassembly;" WO 96/33207 by Stemmer and Lipschutz. "End 

30 Complementary Polymerase Chain Reaction;" WO 97/20078 by Stemmer and Crameri 
"Methods for Generating Polynucleotides having Desired Characteristics by Iterative 
V Selection and Recombination;" WO 97/35966 by Minshull and Stemmer. "Methods and 
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Compositions for Cellular and Metabolic Engineering;" WO 99/41402 by Punnonen ei al. 
"Targeting of Genetic Vaccine Vectors;" WO 99/41383 by Punnonen et al, "Antigen 
Library Immunization;" WO 99/41369 by Punnonen et al, "Genetic Vaccine Vector 
Engineering;" WO 99/41368 by Punnonen et al, "Optimization of Immunomodulatory 
5 Properties of Genetic Vaccines;" EP 752008 by Stemmer and Crameri, "DNA 

Mutagenesis by Random Fragmentation and Reassembly;" EP 0932670 by Stemmer 
"Evolving Cellular DNA Uptake by Recursive Sequence Recombination;" WO 99/23107 
by Stemmer et al, "Modification of Virus Tropism and Host Range by Viral Genome 
Shuffling;" WO 99/21979 by Apt et al, "Human Papillomavirus Vectors;" WO 98/31837 

10 by Del Cardayre et al, "Evolution of Whole Cells and Organisms by Recursive Sequence 
Recombination;" WO 98/27230 by Patten and Steamier, "Methods and Compositions for 
Polypeptide Engineering;" EP 0946755 by Patten and Stemmer, "Methods and 
Compositions for Polypeptide Engineering;" and WO 98/13487 by Stemmer et al, 
"Methods for Optimization of Gene Therapy by Recursive Sequence Shuffling and 

15 Selection;" WO 00/00632, "Methods for Generating Highly Diverse Libraries," WO 
00/09679, "Methods for Obtaining in vitro Recombined Polynucleotide Sequence B anks 
and Resulting Sequences," WO 98/42832 by Arnold et al, "Recombination of 
Polynucleotide Sequences Using Random or Defined Primers," WO 99/29902 by Arnold 
et al, "Method for Creating Polynucleotide and Polypeptide Sequences " WO 98/41653 

20 by Vind, "An in vitro Method for Construction of a DNA Library." WO 98/41 622 by 
Borchert et al.. "Method for Constructing a Library Using DNA Shuffling," and WO 
98/42727 by Pati and Zarling, "Sequence Alterations using Homologous Recombination." 

Certain U.S. applications provide additional details regarding DNA 
shuffling and related techniques, as well as other diversity generating methods, including 

25 "SHUFFLING OF CODON ALTERED GENES" by Patten et al. filed September 29. 

1998 (USSN 60/102,362), January 29. 1999 (USSN 60/1 17.729), and September 28, 1999 
(USSN 09/407,800); "EVOLUTION OF WHOLE CELLS AND ORGANISMS BY 
RECURSIVE SEQUENCE RECOMBINATION", by Del Cardayre et al. filed July 15, 
1998 (USSN 09/166,188), and July 15, 1999 (USSN 09/354,922); 

30 "OLIGONUCLEOTIDE MEDIATED NUCLEIC ACID RECOMBINATION" by 
Crameri et al. filed February 5, 1999 (USSN 60/1 18.813). June 24, 1999 (USSN 
^ 60/141.049). and September 28, 1999 (USSN 09/408,392); "USE OF CODON-BASED 
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OUOONUCLEOTn>E SYNTHESIS FOR Srmmnc s™o" ,y We,ch e, a,., 
filed Sep,ember 28. 1999 (USSN 09/408,393); "METHODS FOR MAKING 
CHARACTER STRINGS. POLVNUCLEOTmES & POLYPEWIDES HAVD.0 
DESIRED CHARACTERISTICS" by Selifonov .„d S,e™mer. filed Fcbrua,y 5 ,999 
(USSN 60/M8854,a„d October 12, ,999 (USSN 09/4,6,375); RECOMBINAtIon OF 
WSERTION MODn=mD NUCLHC ACmS by P.„e„ « March 5. ,999 (USSN 
60 ,22,943), ,u,y 2, 1999 (USSN 60/,42,299,, November ,0. ,999 (USSN 60/,64 6,8, 
and November ,0, ,999 (USSN 60/164,617); aad "Sff.OLE.STRANDED NUCLEIC ' 
ACID TEMPLATE-MEDIATED RECOMBINATION AND NUCLEIC ACm 
10 FRAGMENT ISOLATION" by Afftol.er, USSN 60/186,482 filed March 2, 2000. 

As aieview of the foregoing publica.ions. palenls. published foreign 
appI.ca,io„s and U.S. patent applications reveals, diverstty generation methods, such as 
shuffltng (or Vcursive recombination") of nucleic acids, to provtdc new nucleic acids 
With desired properties can be carried out by a number of establtshed methods. An, of 
these methods can be adapted to the present invention to evolve the alpha interferons 
discussed herein to produce new alpha tnterferon homologues with new or impmved ' 
properties. Both the methods of making such intetferons and the interferons (e g IFN 
homologues) produced by these methods are a feature of .he invention.In bHef, se'vera, 
dtffercnt general classes of sequence modification methods, such as .combination, a« 
.0 Wbcable to the present invention and set f„,ti,.e,g.. in the refen^nces above Fim 

nucletc acds can be recombined ,„ virro by any of a variety of techniques discussed in the 
references above, .ncluding e.g., DNAse diges.io„ of nucleic acids ,0 be .ecombined 

followedbyligationand/orPCR^assembly Of the nucleic acids. Second, nucleic acids 

-"^-"--'>««.mbi„ed/„v.V<,„r„v,Vo.e.g..byaIlowi„grecombi„ati„„.ooccur 
between nucleic acids in cells. Third, whole ge«>me „combina.io„ me.hods can be used 
.n wh,ch whole genomes of cells or other organisms are tecombined, optionally inCudin. 
sp.k.ng of .he genomic recombination mixtutes with desited libt^y components (e . ° 
genes corresponding ,0 the pathways of the ptesem invention). Founh, sy„,he„c 
.«»mbi„ation methods can be used, in which oligonucleotides correspondtng to targets of 
.merest are synd,esized and ^assembled in PCR or ligation react.ons which include 
ol.gonucleo,ides which conespond to more than one parental nucleic acid, thereby 
generahng new recombined nucleic acids. OligonuCeoiides can be made by standard 

56 



)OCID. <W0_ .0l2543eA2_l 



PCT/USOO/27781 

NVO 01/25438 

K m»de « <! by <ri.nucleolid« syn.haic approaches, 
nodeoude addUion me.hods, or caa be made. y ^ 
,,.,>nsnico.e,.ods„,r=co.bina.lo„ca.^ef^«-^"^^^^^^^ 

„^.aco™pu,er,creco.b.= s....--^^^^^^^ 
ever. „on-homologo.s) nudeic acds. The resu^u g 

combined s«,uen„ . .... - ^^^_^^^„^,,„„ ear, be 

.as3embly.echniques.Ar,yof.hepr«e 88 ^^__^^^^^^.„^,„„,,,ie 

practiced i- a reiterative fashion «> ^"^^ of diverse 

aeids.Si«h.,».hodscf.ccessir,gna,ura,d,ve«..^^^ 

, „„eleic acids or nnCeic acid fragn«n« '°;"^„;,_,, ,e,.ences. oprionaliy foi.owed b, 
p„,„„e.«.ion and/or ligation to ,e^n«a ^^^^^^ ^^^^^^^ ^ ^ 

^g^adation of the „comb,na„on formal as »eU as 

^. above references prov.^ » » , used, the nuOeic 

.any nrodifications of these ^ „ ,e,a,ed (or even 
, acids of the invention can be -^'^^^ ^^^„,,„,, „„,e. acds. inCuding 

„„e,a,ed,,,ucieicacidstop.od»«27.:-^„.ee.eeon,bina,^^^ 

eg., homologous nucleic acids. g ■ .^^ ^combination 

------ -"7::*;: r;;r^ 

between the nucleic adds of SEQ ^^^^^^ p„,iai„g a 

^» ::::::::::::rrr:::-comb,.ionsof.^^^^ 

=«-"^^''^;:;::ingrecombina.ion.anyn.^^^ 
_«.„se,ec.edfo.ades,redac,,vi.y. .n.heco™«^^^^^ 

« ir^luaetestingforandidentifyingany.^^^^^ 

b, any assay ^'^2^^: „al availability, or the li-ce can 

■r:r:;n:r:l.pbaLrferonre,a. 

ean be assayed for, using an, ava"aW= "-"^^ ^ ^ ^ ^ ,^„e3«e. ™e»is 

DNA mutagenesis and shufnmg p 

.,enera.ing-ive.sity^f.ford^-^^^^^^^ 
organisms with improved charactensucs. m 



wo 01/25438 PCT/USOO/27781 

it is sometimes desirable to combine shuffling methodologies with other techniques for 
generating diversity. In conjunction with (or separately from) shuffling methods, a variety 
of diversity generation methods can be practiced and the results {Le,, diverse populations 
of nucleic acids) screened for in the systems of the invention. Additional diversity can be 
5 introduced by methods which result in the alteration of individual nucleotides or groups of 
contiguous or non-contiguous nucleotides, Le.j mutagenesis methods. Many mutagenesis 
methods are found in the above-cited references; additional details regarding mutagenesis 
methods can be found in the references listed below. 

Mutagenesis methods of generating diversity include, for example, 

10 recombination (PCT/US98/05223; Publ. No. W098/42727); site-directed mutagenesis 
(Ling et al. (1997) ^'Approaches to DNA mutagenesis: an overview," Anal. Biochem. 
254(2): 157-178; Dale et al. (1996) "Oligonucleotide-directed random mutagenesis using 
the phosphorothioate method," Methods Mol. Biol. 57:369-374; Smith (1985) ''In vitro 
mutagenesis," Ann. Rev. Genet. 19:423-462; Botstein & Shortle (1985) ''Strategies and 

15 applications of in vitro mutagenesis," Science 229:1 193-1201; Carter (1986) "Site- 
directed mutagenesis," Biochem, J. 237:1-7; and Kunkel (1987) "The efficiency of 
oligonucleotide directed mutagenesis," in Nucleic Acids & Molecular Biology (Eckstein, 
F. and Lilley, D.M J. eds.. Springer Verlag, Berlin)); mutagenesis using uracil containing 
templates (Kunkel (1985) "Rapid and efficient site-specific mutagenesis without 

20 phenotypic selection," Proc. Nat' I Acad. Sci. USA 82:488-492; Kunkel et al. (1987) 
"Rapid and efficient site-specific mutagenesis without phenotypic selection." Results 
Probl. Cell Differ. 154. 367-382; and Bass et al. (1988) "Mutant Trp repressors with new 
DNA-binding specificities," Science 242:240-245); oligonucleotide-directed mutagenesis 
(Results Probl. Cell Differ. 100:468-500 (1983); Results Probl. Cell Differ. 154:329-350 

25 (1987); Zoller & Smith (1982) "Oligonucleotide-directed mutagenesis using M13-derived 
vectors: an efficient and general procedure for the production of point mutations in any 
DNA fragment," Nucleic Acids Res. 10:6487-6500; Zoller & Smith (1983) 
"Oligonucleotide-directed mutagenesis of DNA fragments cloned intoM13 vectors," 
Results Probl. Cell Differ. 100:468-500; and Zoller & Smith (1987) "Oligonucleotide- 

30 directed mutagenesis: a simple method using two oligonucleotide primers and a single- 
stranded DNA template," Results Probl. Cell Differ. 154:329-350); phosphorothioate- 
^ modified DNA mutagenesis (Taylor et al. (1985) "The use of phosphorothioate-modified 
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DNA in resiriciion enzyme reactions to prepare nicked DNA " Nucl. Acids Res. 13:8749- 
8764; Taylor et al. (1985) 'The rapid generation of oligonucleotide-direcied mutations at 
high frequency using phosphorothioate-modified DNA,'* Nucl. Acids Res. 13:8765-8787 
(1985); Nakamaye & Eckstein (1986) ''Inhibition of restriction endonuclease Nci I 
5 cleavage by phosphorothioate groups and its application to oligonucleotide-directed 

mutagenesis " Nucl. Acids Res. 14:9679-9698; Sayers et al. (1988) "Y-T Exonucleases in 
phosphorothioate-based oligonucleotide-directed mutagenesis/' Nucl. Acids Res. 16:791- 
802; and Sayers et al. (1988) "Strand specific cleavage of phosphorolhioate-containing 
DNA by reaction with restriction endonucleases in the presence of ethidium bromide," 

10 Nucl. Acids Res. 16:803-814); mutagenesis using gapped duplex DNA (Kramer et al. 
(1984) "The gapped duplex DNA approach to oligonucleotide-directed mutation 
construction," Nucl. Acids Res. 12:9441-9456; Kramer & Fritz (1987) "Oligonucleotide- 
directed construction of mutations via gapped duplex DNA," Results Probl. Cell Differ. 
154:350-367; Kramer et al. (1988) "Improved enzymatic in vitro reactions in the gapped 

1 5 duplex DNA approach to oligonucleotide-directed construction of mutations " Nucl. Acids 
Res. 16:7207; and Fritz et al. (1988) "Oligonucleotide-directed construction of mutations: 
a gapped duplex DNA procedure without enzymatic reactions in vitro,'' Nucl. Acids Res. 
16:6987-6999). 

Additional suitable methods include point mismatch repair (Kramer et al. 

20 ( 1 984) "Point Mismatch Repair," CeU 38:879-887), mutagenesis using repair-deficient 
host strains (Carteret al. (1985) "Improved oligonucleotide site-directed mutagenesis 
using M13 vectors," Nucl. Acids Res. 13:4431-4443; and Carter (1987) "Improved 
oligonucleotide-directed mutagenesis using M13 vectors." Results Probl. Cell Differ. 
154:382-403), deletion mutagenesis (Eghledarzadeh & Henikoff (1986) "Use of 

25 oligonucleotides to generate large deletions," Nucl. Acids Res. 14:5115), restriction- 
selection and restriction-selection and restriction-purification (Wells et al. (1986) 
"Importance of hydrogen-bond formation in stabilizing the transition state of subtilisin," 
Phil. Trans. R. Soc. Lond. A 317:415-423), mutagenesis by total gene synthesis (Nambiar 
et al. (1984) 'Total synthesis and cloning of a gene coding for the ribonuclease S protein," 

30 Science 223:1299-1301; Sakamar and Khorana (1988) "Total synthesis and expression of 
a gene for the a-subunit of bovine rod outer segment guanine nucleotide-binding protein 
(transducing)," Nucl. Acids Res. 14:6361-6372; Wells et al. (1985) "Cassette mutagenesis: 
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an efficient method for generation of niultiple mutations at defined sites," Gene 34:315- 
323; and Grundstrom et al. (1985) *'01igonucIeotide-directed mutagenesis by microscale 
'shot-gun' gene synthesis " Nucl. Acids Res. 13:3305-3316), double-strand break repair 
(Mandecki (1986) "Oligonucleotide-directed double-strand break repair in plasmids of 
5 Escherichia coli: a method for site-specific mutagenesis," Proc. Nat'l Acad. Sci. USA , 
83:7177-7181). Additional details on many of the above methods can be found in 
Methods in Enzvmology , Vol. 154, which also describes useful controls for trouble- 
shooting problems with various mutagenesis methods. 

Random or semi-random mutagenesis using doped or degenerate 

10 oligonucleotides (Arkin and Youvan (1992) "Optimizing nucleotide mixtures (o encode 
specific subsets of amino acids for semi-random mutagenesis," Biotechnology 10:297- 
300; Reidhaar-Olson et al. (1991) "Random mutagenesis of protein sequences using 
oligonucleotide cassettes," Methods Enzvmol . 208:564-86; Lim and Sauer (1991) "The 
role of internal packing interactions in determining the structure and stability of a protein," 

15 J. Mol. Biol. 219:359-76: Breyer and Sauer (1989) "Mutational analysis of the fine 

specificity of binding of monoclonal antibody 51F to lambda repressor," J. Biol. Chem. 
264:13355-60); "Walk-Through Mutagenesis" (Crea. R.; US Patents 5,830,650 and 
5,798,208. and EP Patent 0527809 Bl) may also be employed to generate diversity. 

In one aspect of the present invention, error-prone PGR can be used to r 

20 generate nucleic acid variants. Using this technique, PGR is performed under conditions 
where the copying fidelity of the DNA polymerase is low, such that a high rate of point 
mutations is obtained along the entire length of the PGR product. Examples of such 
techniques are found in the references above and, e.g., in Leung et al. (1989) Technique 
1:11-15 and Galdwell et al. (1992) PGR Methods Applic. 2:28-33. Similarly, assembly 

25 PGR can be used, in a process which involves the assembly of a PGR product from a 

mixture of small DNA fragments. A large number of different PGR reactions can occur in 
parallel in the same vial, with the products of one reaction priming the products of another 
reaction. Sexual PGR mutagenesis can be used in which homologous recombination 
occurs between DNA molecules of different but related DNA sequence in vitro, by 

30 random fragmentation of the DNA molecule based on sequence homology, followed by 
fixation of the crossover by primer extension in a PGR reaction. This process is described 
^ in the references above, e,g., in Stemmer (1994) Proc. NatM Acad. Sci. USA 91:10747- 
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10751. Recursive ensemble mutagenesis can be used in which an algorithm for protein 
mutagenesis is used to produce diverse populations of phenotypically related mutants 
whose members differ in amino acid sequence. This method uses a feedback mechanism 
to control successive rounds of combinatorial cassette mutagenesis. Examples of this 
5 approach are found in Arkin & Youvan (1992) Proc. NatM Acad. Sci. USA 89:7811-7815. 

As noted, oligonucleotide directed mutagenesis can be used in a process 
which allows for the generation of site-specific mutations in any nucleic acid sequence of 
interest. Examples of such techniques are found in the references above and, e.g., in 
Reidhaar-Olson et al. (1988) Science . 241:53-57. Similarly, cassette mutagenesis can be 
10 used in a process which replaces a small region of a double stranded DNA molecule with a 
synthetic oligonucleotide cassette that differs from the native sequence. The 
oligonucleotide can contain, e.g., completely and/or partially randomized native 
sequence(s). 

In vivo (or ex vivo) mutagenesis can be used in a process of generating 

15 random mutations in any cloned DNA of interest which involves the propagation of the 
DNA, e.g., in a strain of £. coli thai carries mutations in one or more of the DNA repair 
pathways. These "mutator" strains have a higher random mutation rate than that of a wild- 
type parent. Propagating the DNA in one of these strains will eventually generate random 
mutations within the DNA. 

20 Exponential ensemble mutagenesis can be used for generating 

combinatorial libraries with a high percentage of unique and functional mutants, where 
small groups of residues are randomized in parallel to identify, at each altered position, 
amino acids which lead to functional proteins. Examples of such procedures are found in 
Dclegrave & Youvan (1993) Biotechnology Research 11:1548-1552. Similarly, random 

25 and site-directed mutagenesis can be used. Examples of such procedures are found in 
Arnold (1993) Current Opinion in Biotechnoloev 4:450-455. 

Kits for mutagenesis, library construction, and other diversity generation 
methods iire also commercially available. For example, kits are available from, e.g., 
Slratagene (e.g.. QuickChange™ site-directed mutagenesis kit; and Chameleon™ double- 

30 stranded, site-directed mutagenesis kit), Bio/Can Scientific, Bio-Rad (e.g.; using the 

Kunkel method described above), Boehringer Mannheim Corp., Clonetech Laboratories, 
DNA Technologies, Epicentre Technologies (e.g., 5 prime 3 prime kit); Genpak Inc. 
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Lemargo Inc, Life Technologies (Gibco BRL). New England Biolabs, Pharmacia Biotech, 
Promega Corp., Quantum Biotechnologies, Amersham Iniemaiional pic {e.g., using the 
Eckstein method above), and Anglian Biotechnology Ltd {e.g., using the Carter/Winter 
method above). 

5 Any of the described shuffling or mutagenesis techniques can be used in 

conjunction with procedures which introduce additional diversity into a genome, e.g., a 
bacterial, fungal, animal or plant genome. For example, in addition to the methods above, 
techniques have been proposed which produce chimeric nucleic acid multimers suitable 
for transformation into a variety of species {see, e.g., Schellenberger U.S. Patent No. 

10 5,756,316 and the references above). When such chimeric multimers consist of genes that 
are divergent with respect to one another {e.g., derived from natural diversity or through 
application of site directed mutagenesis, error prone PGR, passage through mutagenic 
bacterial strains, and the like), are transformed into a suitable host, this provides a source 
of nucleic acid diversity for DNA diversification. 

15 Chimeric multimers transformed into host species are suitable as substrates 

for in vivo (or ex vivo) shuffling protocols. Alternatively, a multiplicity of polynucleotides 
sharing regions of partial sequence similarity or homology can be transformed into a host 
species and recombined in vivo (or ex vivo) by the host cell. Subsequent rounds of cell 
division can be used to generate libraries, members of which, comprise a single, 

20 homogenous population of monomeric or pooled nucleic acid. Alternatively, the 
monomeric nucleic acid can be recovered by standard techniques and recursively 
recombined in any of the described shuffling formats. 

Chain termination methods of diversity generation have also been proposed 
(see, e.g., U.S. Patent No. 5,965,408 and the references.above). In this approach, double 

25 stranded DNAs corresponding to one or more genes sharing regions of sequence similarity 
or homology are combined and denature, in the presence or absence of primers specific for 
the gene. The single stranded polynucleotides are then annealed and incubated in the 
presence of a polymerase and a chain terminating reagent {e.g., ultraviolet, gamma or X- 
ray irradiation; ethidium bromide or other intercalators; DNA binding proteins, such as 

30 single strand binding proteins, transcription activating factors, or histones; polycyclic 
aromatic hydrocarbons; trivalent chromium or a trivalent chromium salt; or abbreviated 
polymerization mediated by rapid thermocycling; and the like), resulting in the production 
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of partial duplex molecules. The partial duplex molecules, e.g., containing partially 
extended chains, are then denatured and reannealed in subsequent rounds of replication or 
partial replication resulting in polynucleotides which share varying degrees of sequence 
similarity or homology and which are chimeric with respect to the starting population of 
5 DNA molecules. Optionally, the products or partial pools of the products can be amplified 
at one or more stages in the process. Polynucleotides produced by a chain tenninaiion 
method, such as described above are suitable substrates for diversity generation methods 
{e.g., RSR, DNA shuffling) according to any of the described formats. 

Diversity can be further increased by using methods which are not 
10 homology based with DNA shuffling (which, as set forth in the above publications and 
applications can be homology or non-homology based, depending on the precise format). 
For example, incremental truncation for the creation of hybrid enzymes (ITCHY) 
described in Ostermeieret al. (1999) "A combinatorial approach to hybrid enzymes 
independent of DNA homology" Nature Biotech . 17:1205, can be used to generate an 
15 initial recombinant library which serves as a substrate for one or more rounds of in vitro, 
ex vivo, or in vivo diversity generation methods (e.g., RSR or shuffling methods). 

Methods for generating multispecies expression libraries have been 
described (e.g,, U.S. Patent Nos. 5.783,431; 5,824,485 and the references above) and their 
use to identify protein activities of interest has been proposed (U.S. Patent 5,958,672 and 
20 the references above). Multispecies expression libraries are, in general, libraries 

comprising cDNA or genomic sequences from a plurality of species or strains, operably 
linked to appropriate regulatory sequences, in an expression cassette. The cDNA and/or 
genomic sequences are optionally randomly concatenated to further enhance diversity. 
The vector can be a shuttle vector suitable for transformation and expression in more than 
25 one species of host organism, e.g., bacterial species, eukaryotic cells. In some cases, the 
library is biased by preselecting sequences which encode a protein of interest, or which 
hybridize to a nucleic acid of interest. Any such libraries can be provided as substrates for 
any of the methods herein described. 

In some applications, it is desirable to preselect or prescreen libraries {e.g., 
30 an amplified library, a genomic library, a cDNA library, a normalized library, etc.) or 

other substrate nucleic acids prior to shuffling, or to otherwise bias the substrates towards 
nucleic acids that encode functional products (shuffling procedures can also, 
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independently have these effects). For example, in the case of antibody engineering, it is 
possible to bias the shuffling process toward antibodies with functional antigen binding 
sites by taking advantage of in vivo (or ex vivo or in vitro) recombination events prior to 
diversity generation (e.g., DNA shuffling) by any described method. For example, 

5 recombined CDRs derived from B cell cDNA libraries can be amplified and assembled 
into framework regions {e.g., Jirholt et al (1998) "Exploiting sequence space: shuffling in 
vivo formed complementarity determining regions into a master framework/' Gene 
215:471) prior to diversity generation {e.g., DNA shuffling) according to any of the 
methods described herein. 

10 Libraries can be biased towards nucleic acids which encode proteins with 

desirable activities {e.g., binding affinities, enzymatic activities, anti-viral activities, 
ability to induce an immune response, antiproliferative activities, adjuvant properties, 
etc.): For example, after identifying a clone from a library which exhibits a specified 
activity, the clone can be mutagenized using any known method for introducing DNA 

15 alterations, including, but not restricted to, DNA shuffling or another form of recursive 
sequence recombination or diversity generation. A library comprising the mutagenized 
homologues is then screened for a desired activity, which can be the same as or different 
from the initially specified activity. An example of such a procedure is proposed in U.S. 
Patent No. 5,939,250. Desired activities can be identified by any method known in the art. 

20 For example, WO 99/10539 proposes that gene libraries can be screened by combining 
extracts from the gene library with components obtained from metabolically rich cells and 
identifying combinations which exhibit the desired activity. It has also been proposed 
{e.g., WO 98/58085) that clones with desired activities can be identified by inserting 
bioactive substrates into samples of the library, and detecting bioactive fluorescence 

25 corresponding to the product of a desired activity using a fluorescent analyzer, e.g. , a flow 
cytometry device, a CCD, a fluoromeier, or a spectrophotometer. 

Libraries can also be biased towards nucleic acids which have specified 
characteristics, e.g., hybridization to a selected nucleic acid probe. For example, 
application WO 99/10539 proposes that polynucleotides encoding a desired activity {e.g., 

30 an enzymatic activity, for example: a lipase, an esterase, a protease, a glycosidase, a 

glycosyl transferase, a phosphatase, a kinase, an oxygenase, a peroxidase, a hydrolase, a 
hydratase, a nitrilase, a transaminase, an amidase or an acylase) can be identified from 
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among genomic DNA sequences in the following manner. Single stranded DN A 
molecules from a population of genomic DNA are hybridized to a ligand-conjugaied 
probe. The genomic DNA can be derived from either a cultivated or uncultivated 
microorganism, or from an environmental sample. Alternatively, the genomic DNA can be 
5 derived from a multicellular organism, or a tissue derived therefrom. 

Second strand synthesis can be conducted directly from the hybridization 
probe used in the capture, with or without prior release from the capture medium or by a 
wide variety of other strategies known in the art. Alternatively, the isolated single- 
stranded genomic DNA population can be fragmented without further cloning and used 
10 directly in a shuffling-based gene reassembly process. In one such method the fragment 
population derived the genomic library(ies) is annealed with partial, or, often 
approximately full length ssDNA or RNA corresponding to the opposite strand. Assembly 
of complex chimeric genes from this population is the mediated by nuclease-base removal 
of non-hybridizing fragment ends, polymerization to fill gaps between such fragments and 
15 subsequent single stranded ligation. The parental strand can be removed by digestion (if 
RNA or uracil-containing), magnetic separation under denaturing conditions (if labeled in 
a manner conducive to such separation) and other available separation/purification 
methods. Alternatively, the parental strand is optionally co-purified with the chimeric 
strands and removed during subsequent screening and processing steps. As set forth in 
20 "Single-stranded nucleic acid template-mediated recombination and nucleic acid fragment 
isolation" by Affholter (USSN 60/186,482, filed March 2, 2000) and WO 98/27230, 
"Methods and Compositions for Polypeptide Engineering" by Patten and Stemmer, 
shuffling using single-stranded templates and nucleic acids of interest which bind to a 
portion of the template can also be performed. 
25 In one approach, single-stranded molecules are converted to double- 

stranded DNA (dsDNA) and the dsDNA molecules are bound to a solid support by ligand- 
mediated binding. After separation of unbound DNA, the selected DNA molecules are 
released from the support and introduced into a suitable host cell to generate a library 
enriched sequences which hybridize to the probe. A library produced in this manner 
30 provides a desirable substrate for any of the shuffling reactions described herein. 

"Non-Stochastic" methods of generating nucleic acids and polypeptides are 
alleged in Short, J. "Non-Stochastic Generation of Genetic Vaccines and Enzymes " WO 
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00/46344. These methods, including the proposed non-stochastic polynucleotide 
reassembly and gene site saturation mutagenesis and synthetic ligation polynucleotide 
reassembly methods outlined therein, can be applied to the present invention as well. 

It will readily be appreciated that any of the above described techniques 
5 suitable for enriching a library prior to diversification can also be used to screen the 
products, or libraries of products, produced by the diversity generating methods. 

A recombinant nucleic acid produced by recursively recombining one or 
more polynucleotides of the invention with one or more additional nucleic acids also 
forms a part of the invention. The one or more additional nucleic acids may include 
10 another polynucleotide of the invention; optionally, alternatively, or in addition, the one or 
more additional nucleic acids can include, e.g., a nucleic acid encoding a naturally- 
occurring interferon-alpha or a subsequence thereof, or any homologous interferon-alpha 
sequence or subsequence thereof, or an interferon-beta sequence or subsequence thereof 
(e.g.. an interferon-alpha or interferon-beta sequence as found in GenBank or other 
15 available literature), or. e.g., any other homologous or non-homologous nucleic acid 
(certain recombination formats noted above, notably those perfonned synthetically or in 
silico, do not require homology for recombination). 

The recombining steps may be perfonned in vivo, ex vivo, in vitro, or in 
silico as described in more detail in the references above. Also included in the invention 
20 IS a cell containing any resulting recombinant nucleic acid, nucleic acid libraries produced 
by diversity generation, recombination, or recu^ive recombination of the nucleic acids set 
forth herein, and populations of cells, vectors, viruses, plasmids or the like comprising the 
hbrary or comprising any recombinant nucleic acid resulting from diversity generation or 
recombination (or recursive recombination) of a nucleic acid as set forth herein with 
25 another such nucleic acid, or an additional nucleic acid. Corresponding sequence strings 
■n a database present in a computer system or computer readable medium are a feature of 
the invention. 

OTHER Pni.YN UCLROTmF roMPO.smnN*; 

The invention also includes compositions comprising two or more 
polynucleotides of the invention (eg., as substrates for recombination). The composition 
can comprise a libraiy of recombinant nucleic acids, where thb library contains at least 2, 
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occurring interferon-alphas is readily perfonned by one of ordinary skill in the art using 
publicly available databases and alignment programs. 

The invention also provides a polypeptide comprising at least about 100, 
120, 130, 140, 150, 155, 160, 163, 165, or 166 contiguous amino acids of any one of SQ 
5 ID NOS:36-70 or SEQ ED N0:71. In one aspect, said amino acid sequence comprises 
amino acids Lysl60 and Glul66, wherein the numbering of the amino acids in the 
sequence corresponds to that of SEQ ID NO:36. 

Several conclusions may be drawn from comparison of the exemplary 
sequences of the invention (Fig. 1) to sequences of known, naturally-occurring interferon- 
10 alphas and other Type I interferons (including beta, delta, omega, and tau-interferons) 

from hunian and nonr^human sources. Such sequences are readily available from a variety 
of sources, such as GenBank, and the Pfam (Protein Families) database at 
http://www.sanger.ac,uk/Software/Pfam/index.shtmL 

Of particular note is the presence, in some interferon homologue 
15 polypeptide sequences of the invention, of the following amino acid residues (denoted 
"Group I" residues) which do not appear in the equivalent position of known, naturally- 
occurring human or non-human Type 1 interferon sequences. 

Group I : Aspll; Prol4; Arg50; Phe55; Asp75; AsnSO; Prol 11; Leul24; 
Glul34; SerMO, and Alal43; with residue numbering corresponding to the mature 
20 interferon homologue sequence identified as SEQ ID NO:36. 

Also of note is the presence, in some interferon homologue polypeptide 
sequences of the invention, of the following amino acid residues (denoted "Group H" 
residues) which do not appear in the equivalent position of known, naturally-occurring 
human interferon-alpha subtype sequences. 
25 Group n : Pro9; (Lys, Ser)12; (Thr, Val)24; Gln34; Arg40; Ser45; Arg47; 

Leu56; Ile60; Phe67; Ala79, Gly88; His90; Arg91; Glu95; VallOl; (Gly, Ala)104; 
Vail 12; Glyl 14; Prol 16; Lysl33, and Hisl36. 

In other embodiments, the interferon homologue polypeptide comprises at 
least 20, 50, 100, 150, 155, or 160 of more contiguous amino acids of any one of SEQ ID 
30 NOS:36-70 and/or one or more of amino acids Alal9, (Tyr or Gln)34, Gly37, Phe38, 
Lys71. Ala76, Tyr90, llel32, Argl34, Phel52, Lysl60, and Glul66, wherein the 
' numbering of the amino acids corresponds to that of SEQ DD NO:36, or one or more of 
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amino acids Pro9, (Lys or Ser)12, (Thr or Val)24, Gln34, Arg40, Ser45, Arg47, Leu56, 
Ile60, Phe67, Ala79. Gly88, His90, Arg91, Glu95, VallOl. (Gly. Ala)104, Vail 12, 
Glyl 14, Prol 16, Lysl33, and Hisl36, wherein the numbering of the amino acids in said 
polypeptide sequence corresponds to the numbering of individual amino acids in the 

5 amino acid sequence of SEQ ID NO:36. Thus, for example, in this embodiment, an 
interferon polypeptide comprises an amino acid sequence comprising a proline residue at 
amino acid position 9 in the sequence, a lysine or serine residue at position 12, a threonine 
or valine residue at position 24, a glutamine residue at position 34, an arginine residue at 
position 40, etc. Such polypeptides may exhibit antiproliferative activities in a human 

10 Daudi cell line-based proliferation assay (e.g., at least about 8.3x10^ units/mg) and/or an 
antiviral activities in a human WISH cell/EMC V-based assay (at least about 2.1x10^ 
units/mg). Some such polypeptides bind a human alpha interferon receptor. Some such 
polypeptides are 166 amino acids in length. In another aspect, such polypeptides may 
comprise a sequence selected from any of the group of SEQ ID NO:36 to SEQ ID NO:54. 

15 An antiproliferative activity of any polypeptide of the invention generally 

relates to the capability or ability of a polypeptide to cause cells or parts thereof to grow or 
produce new cellular growth rapidly and often repeatedly. 

The invention further includes a polypeptide (e.g., any of SEQ ID NOS:36- 
71 or SEQ ID NOS:79-85) or a nucleic acid {e.g., any of SEQ ID NOS:l-35 or SEQ ID 

20 NOS:72-78)encoding a polypeptide, wherein said polypeptide having an anti-angiogenic 
activity as measured by an anti-angiogenesis assay well known to those of ordinary skill in 
the art. 

The invention further includes: 

(a) any interferon-alpha polypeptide comprising one or more Group I 
25 amino acid residues above. 

(b) any interferon-alpha polypeptide comprising one or more Group II 
amino acid residues above in the context of a human like interferon sequence (i.e., a 
sequence which displays a high level of similarity or homology to a human interferon), or 
a sequence which is highly similar or homologous {Le., having a percent sequence 

30 homology or sequence identity of at least about 80%, 90%, 95%, 96%, 97%, 98% or 
more) to any sequence listed in the attached sequence listing or fragment thereof. 
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(c) any inierferon-alpha polypeptide conlaining a combination of the 
following residues, which are localized in or near the regions of the interferon-alpha 
molecule known or proposed to interact with a Type I interferon receptor, where such 
sequence combinations (motifs) do not appear in the equivalent position of any known 
5 naturally-occurring human or non-human Type 1 interferon: 

(i) (Tyr or Gln)34; plus one or more of Hel32 or Argl34; or 

(ii) Asp78, Glu79, or (Asp or Thr)80; plus one or more of He 132 or 

Argl34. 

In another embodiment, the present invention provides an interferon alpha 
10 homologue comprising the sequence show in SEQ ID NO:71: CDLPQTHSLG-Xn-Xu- 
RA-Xj5-Xi6-LL-X,9-QM-X22-R-X24-S'X26-FSCLKDR-X34-DFG-X38-P-X40-EEFD-X^ 

X46-X47-FQ-X50-X5rQAI-X55-X56-X57-HE-X60-X6rQQTFN-X67-FSTK-X72-SS.X75-X76- 
W-X78-X79-X80-LL-X83-K-X85-X86-T-X88-L-X90-QQLN-X95-LEACV-XiorQ-X,03-V-X,05- 

X,o6-X,o7-X,o8-TPLMN-Xi,4-D-Xn6-rLAV-X,2i-KY-X,24-QRrrLYL.Xi32-E-Xi^ 

15 KYSPC-Xi4o-WEVVRAErMRSFSFSTNLQKRLRRKE, or a conservatively substituted 
variation thereof, where X| i is N or D; X12 is R. S, or K; X15 is L or M; Xio is I, M, or V; 
X|9 is A or G; X22 is G or R; X24 is I or T; X26 is P or H; X34 is H. Y or Q; X38 is F or L; 
X40 is Q or R; .X45 is G or S; X46 is N or H; X47 is Q or R; X50 is K or R; Xsi is A orT; X55 
is S or F; X56 is V or A; X57 is L or F; X60 is M or I; X61 is I or M; Xo? is L or F; X72 is D 

20 or N; X75 is A or V; X76 is A or T; X78 is E or D; X79 is Q or E; Xgo is S, R, T, or N; Xgs is 
E or D; Xgs is F or L; X86 is S or Y; Xgg is E or G; X90 is Y, H, N; X95 is D. E, or N; Xioi is 
I, M, or V; X103 is E or G; X105 is G or W; X106 is V or M; X107 is E, G. or K; X,o8 is E or 
G; X| u is V, E, or G; Xm6 is S or P; X121 is K or R; Xi24 is F or L; X132 is T. I, or M; Xi34 
is K or R; and X140 is A or S; or a fragment of said SEQ ID NO:7 1 . In another aspect, the 

25 interferon homologue polypeptide of SEQ ID N0:71. or a fragment thereof, exhibits an 
antiproliferative activity in a human Daudi cell line-based proliferation assay (at least 
about 8.3x10^ units/mg) and/or an antiviral activity in a human WISH cell/EMCV-based 
assay (at least about 2.1x10^ units/mg). Both such assays are discussed in greater detail 
below. Such polypeptide may comprise an amino acid sequence of the group of from SEQ 

30 ID NO:36 to SEQ ID NO:54 or may be encoded by a nucleotide sequence of the group of 
from SEQ ID NO:l to SEQ ID N0:19. 
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• Fragttients of the interferon homologue polypeptides described herein are 
also a feature of the invention. An interferon alpha homologue fragment of the invention 
typically comprises an interferon homologue polypeptide comprising at least about 20, 25, 
or 30, and typically at least about 40, 50, 60, 70, 80, 90, or 100 contiguous amino acids of 
5 any oneof SEQED NOS:36-71 or SEQ ID NOS:79-85. In other embodiments, the 
fragment comprises usually at least about 100, 110, 120, 125, 130, 140, 150, 155, 158, 
160, 162, 163, 164, or 165 contiguous amino acids of any one of SEQ ED NOS:36-7I or 
SEQ ID NOS:79-85. Such polypeptide friagments may have an antiproliferative activity in 
a human Daudi cell line-based assay and/or an antiviral activity in a human or murine cell 

10 line/EMCV-based assay. 

In other embodiments, the invention provides polypeptides having a length 
of 166 amina acids, and, in some such embodiments, such polypeptides have an 
antiproliferative activity in a human Daudi cell line-based assay (or other similar assay), 
including, e.g., at least about 8.3x10** units/mg, and/or an antiviral activity in a human 

15 WISH cell line/EMCV-based assay (or other similar assay), including, e.g., at least about 
2.1x10^ units/mg. 

In other embodiments, the invention provides a polypeptide comprising at 
least 100, 150, 155, or 160 contiguous amino acids of a protein encoded by a coding 
polynucleotide sequence comprising any of the following: (a) SEQ ID NO:l to SEQ ID 

20 NO:35 or SEQ ID NO:72 to SEQ ID NO:78; (b) a coding polynucleotide sequence that 
encodes a first polypeptide selected from any of SEQ ID NO:36 to SEQ ID NO:70 or SEQ 
ID NO:79 to SEQ ID NO:85; and (c) a complementary polynucleotide sequence that 
hybridizes under at least highly stringent (or ultra-high stringent or ultra-ultra- high 
stringent conditions) hybridization conditions over substantially the entire length of a 

25 polynucleotide sequence of (a) or (b). Such polypeptides may have an antiproliferative 
activity in a human Daudi cell line-based assay (or other similar assay), and/or an antiviral 
activity in a human WISH cell line/EMCV-based assay (or other similar assay). Some 
such polypeptides of the invention specifically bind a human alpha interferon receptor. 
The polypeptides and nucleic acids of the subject invention need not be identical, but can 

30 be substantially identical, to' the corresponding sequence of the target molecule or related 
molecule, including thepolypeptides of any of SEQ ID NOS:36-71 or fragments thereof 
(including thosc^having.antiviral or antiproliferative activities in the assays described 
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herein), or the nucleic acids of any of SEQ ID NOS:l-35 or fragments thereof (including 
those having antiviral or antiproliferative activities in the assays described herein). The 
polypeptides can be subject to various changes, such as insertions, deletions, and 
substitutions, either conservative or non-conservative, where such changes might provide 
5 for certain advantages in their use. The polypeptides of the invention can be modified in a 
number of ways so long as they comprise a sequence substantially identical (as defined 
below) or having a percent identity to a sequence in the naturally occurring or known 
interferon polypeptide molecule. 

Alignment and comparison of relatively short amino acid sequences (less 

10 than about 30 residues) is typically straightforward. Comparison of longer sequences can 
require more sophisticated methods to achieve optimal alignment of two sequences. 
Optimal alignment of sequences for aligning a comparison window can be conducted by 
the local homology algorithm of Smith and Waterman (1981) Adv. Appi Math. 2:482, by 
the homology alignment algorithm of Needleman and Wunsch (1970) J. Moi Biol 

15 48:443, by the search for similarity method of Pearson and Lipman (1988) Proc. Nat'l 
Acad. ScL (USA) 85:2444, by computerized implementations of these algorithms (GAP, 
BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package Release 
7.0, Genetics Computer Group, 575 Science Dr., Madison, WI). or by inspection, and the 
best alignment {i.e., resulting in the highest percentage of sequence similarity over the 

20 comparison window) generated by the various methods is selected. 

The term sequence identity means that two polynucleotide sequences are 
identical {i.e., on a nucleotide-by-nucleotide basis) over a window of comparison. The 
term "percentage of sequence identity" or "percent sequence identity" is calculated by 
comparing two optimally aligned sequences over the window of comparison, determining 

25 the number of positions at which the identical residues occurs in both sequences to yield 
the number of matched positions, dividing the number of matched positions by the total 
number of positions in the window of comparison {i.e.y the window size), and multiplying 
the result by 100 to yield the percentage of sequence identity. In one aspect, the present 
invention provides interferon homologue nucleic acids having at least about 80%, 85%, 

30 90%, 95%, 96%, 97%, 98%, 99%, 99.5% or more percent sequence identity with the 
nucleic acids of any of SEQ ID NOS:l-35 or SEQ ID NOS:72-78 or fragments thereof. 
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As, applied la polypeptides, the term substantial identity means that two 
peptide sequences, when optimally aligned, such as by the programs GAP or BESTFIT 
using default gap weights (described in detail below), share at least about 80 percent 
sequence identity, preferably at least about 90 percent sequence identity, more preferably 
5 at least about 95 percent sequence identity or more {e.g., 97, 98, or 99 percent sequence 
identity). Preferably, residue positions which are not identical differ by conservative 
amino acid substitutions. Conservative amino acid substitutions refer to the 
interchangeability of residues having similar side chains. For example, a group of amino 
acids having aliphatic side chains is glycine, alanine, valine, leucine, and isoleucine; a 

10 group of amino acids having aliphatic-hydroxyl side chains is serine and threonine; a 

group of amino acids having amide-containing side chains is asparagine and glutamine; a 
group of amino acids having aromatic side chains is phenylalanine, tyrosine, and 
tryptophan; a group of amino acids having basic side chains is lysine, arginine, and 
histidine; and a group of amino acids having sulfur-containing side chains is cysteine and 

15 methionine. Preferred conservative amino acids substitution groups are: valine-leucine- 
isoleucine, phenylalanine-tyrosine, lysine-arginine, alanine-valine, and asparagine- 
glutamine. In one aspect, the present invention provides interferon homologue 
polypeptides having at least about 80%, 85%, 90%, 95%, 96%, 97%, 98% 99% 99.5% or 
more percent sequence identity with the polypeptides of any of SEQ ID NOS:36-71 or 

20 SEQ ID NOS:79-85 or fragments thereof. . 

A preferred example of an algorithm that is suitable for determining percent 
sequence identity and sequence similarity is the FASTA algorithm, which is described in 
Pearson, W.R. & Lipman,D.J., 1988,' Proc. NaHAcad ScL USA 85': 2444. Sei also W. ' 
R. Pearson, 1996, Methods Enzymol 266: 227-258. Preferred parameters used in a 

25 FASTA alignment of DNA sequences to calculate percent identity are optimized, BL50 
Matrix 15: -5, lc-tuple= 2; joining penalty= 40, oplimization= 28; gap penalty -12, gap 
length penalty =-2; and width= 16. 

Another preferred example of algorithm that is suitable for determining 
percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 

'30 algorithms, which are described in Altschul et ai, 1977, Nuc, Acids Res. 25: 3389-3402 
and Altschul et aZ,, 1990. / MoL Biol. 215: 403-410, respectively. BLAST and BLAST 
2.0 are used, with. tbe^ parameters described herein, to determine percent sequence identity 

73 



wo 01/25438 



PCT/ljSOO/27781 



for the nucleic acids and proteins of the invention. Software for performing BLAST 
analyses is publicly available through the National Center for Biotechnology Information 
(http: //www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring 
sequence pairs (HSPs) by identifying short words of length W in the query sequence, 

5 which either match or satisfy some positive-valued threshold score T when aligned with a 
word of the same length in a database sequence. T is referred to as the neighborhood word 
score threshold (Altschul et al, supra). These initial neighborhood word hits act as seeds 
for initiating searches to find longer HSPs containing them. The word hits are extended in 
both directions along each sequence for as far as the cumulative alignment score can be 

10 increased. Cumulative scores are calculated using, for nucleotide sequences, the 

parameters M (reward score for a pair of matching residues; always > 0) and N (penalty 
score for mismatching residues; always < 0). For amino acid sequences, a scoring matrix 
is used to calculate the cumulative score. Extension of the word hits in each direction are 
halted when: the cumulative alignment score falls off by the quantity X from its maximum 

15 achieved value; the cumulative score goes to zero or below, due to the accumulation of 
one or more negative-scoring residue alignments; or the end of either sequence is reached. 
The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the 
alignment. The BLASTN program (for nucleotide sequences) uses as defaults a 
wordlength (W) of 1 1, an expectation (E) of 10, M=5, N=-4 and a comparison of both 

20 strands. For amino acid sequences, the BLAST? program uses as defaults a wordlength of 
3. and expectation (E) of 10, and the BLOSUM62 scoring matrix {see Henikoff & 
Henikoff (1989) Proc. Nat'l Acad ScL U.SA. 89: 10915) alignments (B) of 50. 
expectation (E) of 10, M=5, N=-4, and a comparison of both strands. . 

The BLAST algorithm also performs a statistical analysis of the similarity 

25 between two sequences {see, e.g.. Kariin & Altschul (1993) Proc, Nat'l Acad, ScL U,S.A. 
90: 5873-5787). One measure of similarity provided by the BLAST algorithm is the 
smallest sum probability (P(N)), which provides an indication of the probability by which 
a match between two nucleotide or amino acid sequences would occur by chance. For 
example, a nucleic acid is considered similar to a reference sequence if the smallest sum 

30 probability in a comparison of the test nucleic acid to the reference nucleic acid is less 
than about 0.2, more preferably less than about 0.01, and most preferably less than about 
^ 0.001. 
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Another example of a useful algorithm is PILEUP. PILEUP creates a 
multiple sequence alignment from a group of related sequences using progressive, 
pairwise alignments to show relationship and percent sequence identity. It also plots a tree 
or dendogram showing the clustering relationships used to create the alignment. PELEUP 

5 uses a simplification of the progressive alignment method of Feng & Dooliitle (1987) J. 
Mol Evol 35: 351-360. The method used is similar to the method described by Higgins & 
Sharp (1989) CABIOS 5: 151-153. The program can align up to 300 sequences, each of a 
maximum length of 5,000 nucleotides or amino acids. The multiple alignment procedure 
begins with the pairwise alignment of the two most similar sequences, producing a cluster 

10 of two aligned sequences. This cluster is then aligned to the next most related sequence or 
cluster of aligned sequences. Two clusters of sequences are aligned by a simple extension 
of the pairwise alignment of two individual sequences. The final alignment is achieved by 
a series of progressive, pairwise alignments. The program is run by designating specific 
sequences and their amino acid or nucleotide coordinates for regions of sequence 

15 comparison and by designating the program parameters. Using PILEUP, a reference 
sequence is compared lo other test sequences to determine the percent sequence identity 
relationship using the following parameters: default gap weight (3.00), default gap length 
weight (0.10), and weighted end gaps. PILEUP can be obtained from the GCG sequence 
analysis software package, e.g., version 7.0 (Devereaux et al (1984) Nuc. Acids Res. 12: 

20 387-395. 

Another preferred example of an algorithm that is suitable for multiple 
DNA and amino acid sequence alignments is the CLUSTALW program (Thompson, J. D. 
et al (1994) Nucl Acids. Res, 22: 4673-4680). ClustalW performs multiple pairwise 
comparisons between groups of sequences and assembles them into a multiple alignment 
25 based on homology. Gap open and Gap extension penalties were 10 and 0.05, 

respectively. For amino acid alignments, the BLOSUM algorithm can be used as a protein 
weight matrix (Henikoff and Henikoff (1992) Proc, NaHAcad. Sci, U.S.A. 89: 10915- 
10919). 

Making Polypeptides of the Invention 
30 Recombinant methods for producing and isolating interferon homologue 

polypeptides of the invention are described above. In addition to recombinant production, 

the polypeptides may be produced by direct peptide synthesis using solid-phase techniques 
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. (cf. Stewart et ai (1969) Solid-Phase Peptide Synthesis, W.H. Freeman Co, San 

Francisco; Merrifield, J. (1963) / Am, Chem, Soc, 85:2149-2154). Peptide synthesis may 
be performed using manual techniques or by automation. Automated synthesis may be 
achieved, for example, using Applied Biosystems 431 A Peptide Synthesizer (Perkin 
5 Elmer, Foster City, Calif.) in accordance with the instructions provided by the 

manufacturer. For example, subsequences may be chemically synthesized separately and 
combined using chemical methods to provide full-length interferon homologues. 
Fragments of the interferon homologue polypeptides of the invention, as discussed in 
greater detail above, are also a feature of the invention and may be synthesized by using 

10 the procedures described above. 

Polypeptides of the invention can be produced by introducing into a 
population of cells a nucleic acid of the invention, wherein the nucleic acid is operatively 
linked to a regulatory sequence effective to produce the encoded polypeptide, culturing the 
cells in a culture medium to produce the polypeptide, and optionally isolating the 

15 polypeptide from the cells or from the culture medium. 

In another aspect, polypeptides of the invention can be produced by 
introducing into a population of cells a recombinant expression vector comprising at least 
one nucleic acid of the invention, wherein the at least one nucleic acid is operatively 
linked to a regulatory sequence effective to produce the encoded polypeptide, culturing the 

20 cells in a culture medium under suitable conditions to produce the polypeptide encoded by 
the expression vector, and optionally isolating the polypeptide from the cells or from the 
culture medium. 

Using Polypeptides 
Antibodies 

25 In another aspect of the invention, an interferon homologue polypeptide of 

the invention is used to produce antibodies which have, e.g., diagnostic, prophylactic and 
therapeutic uses, e.g.. related to the activity, distribution, and expression of interferon 
homologues. 

Antibodies to interferon homologues of the invention may be generated by 
30 methods well known in the art. Such antibodies may include, but are not limited to, 

polyclonal, monoclonal, chimeric, humanized, single chain. Fab fragments and fragments 
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produced by an Fab expression library. Antibodies, i.e., those which block receptor 
binding, are especially preferred for therapeutic or prophylactic use. 

Interferon homologue polypeptides for antibody induction do not require 
biological activity; however, the polypeptide or oligopeptide must be antigenic. Peptides 
5 used to induce specific antibodies may have an amino acid sequence consisting of at least 
10 amino acids, preferably at least 15 or 20 amino acids. Short stretches of an interferon 
homologue polypeptide may be fused with another protein, such as keyhole limpet 
hemocyanin, and antibody produced against the chimeric molecule. 

Methods of producing polyclonal and monoclonal antibodies are known to 

10 those of skill in the art, and many antibodies are available. See, e.g., Coligan (199i; 
Current Protocols in Immunology Wiley/Greene, NY; and Harlow and Lane (1989) 
Antibodies: A Laboratory Manual, Cold Spring Harbor Press, NY; Stites et al (eds. j Basic 
and Clinical Immunology (4th ed.) Lange Medical Publications, Los Altos, CA, and 
references cited therein; Coding (1986) Monoclonal Antibodies: Principles and Practice 

15 (2d ed.) Academic Press, New York, NY; and Kohler and Milstein (1975) Nature 
256:495-497. Other suitable techniques for antibody preparation include selection of 
libraries of recombinant antibodies in phage or similar vectors. See^ Huse et al (1989) 
Science 246:1275-1281; and Ward et al (1989) Nature 341:544-546. Specific monoclonal 
and polyclonal antibodies and aiitisera will usually bind with a Kd of at least about O.I 

20 jxM, preferably at least about 0.01 ^iM or better, and most typically and preferably, 0.00 1 
jiM or better. 

Detailed methods for preparation of chimeric (humanized) antibodies can 
be found in U.S. Patent 5,482,856. "Additional details on humanization and other antibody 
production and engineering techniques can be found in Borrebaeck (ed.) {\995) Antibody 

25 Engineering, 2"^ Edition Freeman and Company, NY (Borrebaeck); McCafferty et al 
(\996) Antibody Engineering, A Practical Approach, IRL at Oxford Press, Oxford, 
England (McCafferty), and Paul (1995) Antibody Engineering Protocols, Humana Press, 
Towata. NJ (Paul). . 

In one useful embodiment, this invention provides for fully humanized 

30 antibodies against the interferon homologues of the invention. Humanized antibodies are 
especially desirable.in applications where the antibodies are used as prophylactics and 
therapeutics in vivo and ex vivo in human patients. Human antibodies consist of 
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characteristically human immunoglobulin sequences. The human antibodies of this 
invention can be produced in using a wide variety of methods {see, e.g., Larrick et ai, 
U.S. Pat. No. 5,001 ,065, and Borrebaeck McCafferty and Paul, supra, for a review). In 
one embodiment, the human antibodies of the present invention are produced initially in 
5 trioma cells. Genes encoding the antibodies are then cloned and expressed in other cells, 
such as nonhuman mammalian cells. The general approach for producing human 
antibodies by trioma technology is described by Ostberg et ai (1983), Hybridoma 2:361- 
367. Ostberg, U.S. Pat. No. 4.634.664, and Engelman et ai, U.S. Pat. No. 4,634,666. The 
antibody-producing cell lines obtained by this method are called triomas because they are 
10 descended from three cells; two human and one mouse. Triomas have been found to 
produce antibody more stably than ordinary hybridomas made from human cells. 
Adjuvants 

In one aspect, the interferon homologue polypeptides of the present 
invention or fragments thereof are useful as adjuvants to stimulate, enhance, potentiate, or 
15 augment an immune response related to an antigen when administered together with the 
antigen or after or before delivery of the antigen. In another aspect, the invention provides 
methods for administering one or more of the polypeptides invention described herein to a 
subject. 

Therapeutic and Prophvlactic Agents 
20 As described in greater detail below, the interferon homologue 

polypeptides of the present invention or fragments thereof are useful in the prophylactic 

and/or therapeutic treatment of a variety of diseases, disorders, or medical conditions. 

For example, the invention provides interferon-alpha homologue 

polypeptides (and imerferon-alpha homologue nucleic acids which encode such 

25 polypeptides) that have both antiviral and antiproliferative activities in the assays 
described herein. In one aspect, the invention provides interferon-alpha homologue 
polypeptides (and interferon-alpha homologue nucleic acids which encode such 
polypeptides) in which the ratio of antiviral activity to antiproliferative activity is greater 
than that of other known interferon-alphas such as those listed in GenBank as noted 

30 herein. Such polypeptides (and nucleic acids encoding them) are useful in the therapeutic 
and/or prophylactic treatment of various diseases and disorders, such as, e.g., treatment 
regimens for hepatitis B, hepatitis C. HIV, and HSV. In such treatment regimens, some 
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such polypeptides (and nucleic acids encoding them), such as interferon-alpha homologue 
2BA8, offer significant advantages over known inlerferon-alpha compounds, since they 
likely exhibit lower side effects upon administration than known interferon-alpha 
compounds, such as interferon-alpha 2a, are of higher potency, and thus may require in 
5 lower dosing and cause fewer immunogenicity effects. 

SEQUENCE VARIATIONS 

Conservatively ModiFied Variations 

Interferon homologue polypeptides of the present invention include one or 
more conservatively modified variations (or "conservative variations" or conservative 
10 substitutions") of the polypeptide sequences disclosed herein as SEQ ID NO:36 to SEQ ID 
NO:70 and SEQ ID NO:79 to SEQ ID NO:85. Such conservatively modified variations 
comprise substitutions, additions or deletions which alter, add or delete a single amino 
acid or a small percentage of amino acids (typically less than about 5%, more typically 
less than about 4%, 2%, or 1%) in any of SEQ ID NO:36 to SEQ ID NO:70 and SEQ ID 
15 NO:79toSEQIDNO:85. 

For example, a conservatively modified variation {e.g., deletion) of the 166 
amino acid polypeptide identified herein as SEQ ID NO:36 has a length of at least about 
157 or 158 amino acids, preferably at least about 159 or 160 amino acids, more preferably 
at least about 162 or 163 amino acids, and still more preferably at least about 164 or 165 
20 amino acids, corresponding to a deletion of less than about 5%, 4%, 2% or 1% of the 
polypeptide sequence, respectively. 

Another example of a conservatively modified variation {e.g., a 
"conservatively substituted variation") of the polypeptide identified herein as SEQ ID 
NO:36 will contain "conservative substitutions", according to the six substitution groups 
25 set forth in Table 2 (supra), in up to about 8 residues (i.e.. less than about 5%) of the 166 
amino acid polypeptide. 

The interferon homologue polypeptide sequences of the invention, 
including conservatively substituted sequences, can be present as part of larger 
polypeptide sequences such as which occur upon the addition of one or more domains for 
30 purification of the protein (e.g.. poly His segments. FLAG epitope segments, etc.), e.g.. 
V where the additional functional domains have little or no effect on the activity of the 
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interferon-alpha portion of the protein, or where the additional domains can be removed 
by post synthesis processing steps such as by treatment with a protease. 

In another embodiment, interferon homologue polypeptides of the present 
invention comprise the following sequence, identified herein as SEQ ED N0:71: 
5 CDLPQTHSLG-XirXi2-RA-Xi5-Xi6-LL-X,9-QM-X22-R-X24-S.X26-FSCLKDR-X34- 
DFG-X38-P-X40'EEFD-X45-X46-X47-FQ.X50-X5rQAI.X55-X56-X57-HE-X60-X6rQQTF^ 
X67-FSTK-X72-SS-X75-X76-W-X78-X79-X80-LL-X83-K-X85-X86-T-X88-L-X90-QQLN-X95' 
LEACV-X,orQ-Xio3-V-X,o5-X,o6-X,o7-X,o8>TPLMN-Xn4-D.Xn6-n.AV-X,2rKY-X,24- 
QRITLYL-Xi32-E-Xi34-KYSPC-X,40-WEVVRAEIMRSFSFSTNLQKRLRRKE, or a 

10 conservatively substituted variation thereof, where Xn is N or D; X12 is R, S, orK; X15 is 
L or M; X)6 is I, M, or V; X19 is A or G; X22 is G or R; X24 is I or T; X26 is P or H; X34 is 
H, Y or Q; X38 is F or L; X40 is Q or R; X45 is G or S; X46 is N or H; X47 is Q or R; X50 is 
K or R; X51 is A or T; X55 is S or F; X56 is V or A; X57 is L or F; Xoo is M or 1; X^i is 1 or 
M; X67 is L or F; X72 is D or N; X75 is A or V; X76 is A or T; X78 is E or D; X79 is Q or E; 

15 Xso is S, R, T. or N; Xga is E or D; Xgs is F or L; Xge is S or Y; Xgg is E or G; X90 is Y, H, 
N; X95 is D, E. or N; Xioi is I. M, or V; X103 is E or G; X105 is G or W; X,o6 is V or M; 
Xia7 is E, G, or K; Xio8 is E or G; Xiu is V, E, or G; Xi lo is S or P; X121 is K or R; X124 is 
F or L; X132 is T, I, or M; X134 is K or R; and Xuo is A or S; or a fragment of said SEQ ID 
NO:7L As defined above, a conservatively modified variation of the sequence of SEQ ID 

20 N0:71 can include up to a total of about 8 amino acid deletions, insertions, or conservative 
substitutions in the 166 amino acid polypeptide, excluding the positions designated X in 
SEQ ID NO:71, which correspond to the amino acid explicitly defined. 

As an example, if four conservative substitutions were localized in the 
subsequence corresponding to amino acids 141-166 of SEQ ID NO:71, examples of 

25 conservatively substituted variations of this subsequence, 

WEVVR AEIMR SFSFS TNLQK RLRRKE, include: 
WEVVR SEIMR SFS YS TNLQR RLRRKD and 

WELVR AEIVR SFSFS TNLNK RLRKKE, and the like, where the conservative 
substitutions are underlined. 
30 A feature of the invention is an interferon homologue polypeptide 

comprising at least about 20, usually at least about 25, typically at least about 30, 40, 50, 
' 60. 70. 80, 90, or 100 contiguous amino acids of any one of SEQ ID NOS:36-71 or SEQ 
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ID NOS:79-85. In other embodiments, the polypeptide typically comprises at least about 
100. 110. 120, 125, 130, 140. 150, 155, 158, 160. 163, 164, or 165 contiguous amino acids 
of any one of SEQ ID NOS:36-70 or SEQ ID NOS:79-85. 

In other embodiments, the interferon homologue polypeptide of the 

5 invention comprises an amino acid sequence comprising one or more of amino acid 
residues (Tyr or Gln)34, Gly37, Phe38, Lys71 , Ala76, Tyr90, Ilel32, Argl34, Phel52, 
Lysl60, and Giul66, wherein the numbering of the amino acids corresponds to the 
numbering of amino acids in the amino acid sequence of SEQ ID NO:36. In a preferred 
embodiment, the interferon homologue polypeptide comprises an amino acid sequence 

10 comprising at least 150, 155, or 166 contiguous amino acid residues of any one of SEQ ID 
NOS:36-70, further comprising Lysl60 and GIul66, wherein the numbering of the amino 
acids corresponds lo the numbering of amino acids in the amino acid sequence of SEQ ID 
NO:36. Some such polypeptides also exhibit an antiproliferative activity of at least about 
8.3x10^ units/milligram in the human Daudi cell line - based assay, or an antiviral activity 

15 of at about least 2.1x10^ units/milligram (mg) in the human WISH cell/EMCV-based 
assay. 

DEFINING POLYPEPTIDES BY IMMUNOREACTIVITY 

Because the polypeptides of the invention provide a variety of new 
polypeptide sequences as compared to other alpha interferon homologues, the 

20 polypeptides also provide a new stmctural features which can be recognized, e.g., in 
inmiunological assays. The generation of antisera which specifically binds the 
polypeptides of the invention, as well as the polypeptides which are bound by such 
antisera, are features of the invention. 

The invention includes interferon-alpha homologue polypeptides that 

25 specifically bind to or that are specifically immunoreactive with an antibody or antisera 
generated against an immunogen comprising an amino acid sequence selected from one or 
more of SEQ ID NO:36 to SEQ ID NO:70, SEQ ID NO:71. and SEQ ID NO:79 to SEQ 
ID NO:85, To eliminate cross-reactivity with other interferon-alpha polypeptides, e.g., 
known interferon-alpha polypeptides, the antibody or antisera (or antiserum) is subtracted 

30 with available known alpha interferons, such as those polypeptides encoded by nucleic 

V acids represented by GenBank accession numbers J00210 (alpha-D), J00207 (Alpha-A). 

X02958 (Alpha-6). X02956 (Alpha-5), V00533 (alpha-H), V00542 (alpha-14). V00545 
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(IFN-IB), X03125 (alpha-8), X02957 (alpha-16), V00540 (alpha.21), X02955 (alpha-4b), 
V0O532 (alpha-CX X02960 (alpha-7), X02961 (alpha- 10 pseudogene), R0067 (Gx-1), 
101614, 101787, 107821, M12350 (alpha-F), and M38289, V00549 (alpha-2a), and 108313 
(alpha-Con 1), or any other known inierferon-alpha polypeptides (typically referred to as 
5 the "control alpha interferon polypeptides"). Where the accession number corresponds to 
a nucleic acid, a polypeptide encoded by the nucleic acid is generated and used for 
antibody/antisera subtraction purposes. Where the nucleic acid corresponds to a non- 
coding sequence, e.g., a pseudo gene, an amino acid which corresponds to the reading 
frame of the nucleic acid is generated {e.g., synthetically), or is minimally modified to 

10 include a start codon for recombinant production. 

In one typical format, the immunoassay uses a polyclonal antiserum which 
was raised against one or more polypeptides comprising one or more of the amino acid 
sequences corresponding to one or more of: SEQ ID NO:36 to SEQ ID NO:70, SEQ ID 
NO:71, and SEQ ID NO:79 to SEQ ID NO:85, or a substantial subsequence thereof (i.e., 

15 at least about 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, or 98% or more of the full 
length sequence provided). The full set of potential polypeptide immunogens derived 
from one or more of SEQ ID NO:36 to SEQ ID NO:70. SEQ ID N0:7. and SEQ ID 
NO:79 to SEQ ID NO:85 are collectively referred to below as "the immunogenic 
polypeptides." The resulting aniisera is optionally selected to have low cross-reactivity 

20 against the control alpha interferon polypeptides and/or other known interferon 

polypeptides and any such cross-reactivity is removed by immunoabsorption with one or 
more of the control alpha interferon polypeptides, prior to use of the polyclonal antiserum 
in the immunoassay. 

In order to produce antisera for use in an immunoassay, one or more of the 

25 immunogenic polypeptides is produced and purified as described herein. For example, 
recombinant protein may be produced in a mammalian cell line. An inbred strain of mice 
(used in this assay because results are more reproducible due to the' virtual genetic identity 
of the mice) is immunized with the immunogenic polypeptide(s) in combination with a 
standard adjuvant, such as Freund's adjuvant, and a standard mouse immunization protocol 

30 {see Harlow and Lane (1988) Antibodies, A Laboratory Manual, Cold Spring Harbor 

Publications, New York, for a standard deiscription of antibody generation, immunoassay 
' formats and conditions that can be used to determine specific immunoreactivity). 

82 



XID: <WO 012543eAa 



wo 01/25438 



PCT/IJS0(»/27781 



ANTIPROLIFERATIVE PROPERTIES OF INTERFERON HOMOLOGUES 

The effect of interferon homologues on cellular growth was examined in a 
human Daudi cell line - based assay as described in Example L Fig. 2 shows the 
antiproliferative activity of exemplary interferon homologues of the invention comprising 
5 amino acid sequences SEQ ID NO:36 to SEQ ID NO:54, in comparison to control 
interferons, human IFN-alpha 2a and consensus human IFN-alpha (Conl). The graph 
shows the number of Units of activity per milligram, (mg) of interferon test sample ( Y 
axis) for a set of exemplary interferon alpha homologues, each of which is designated with 
a name (clone name) on the X axis, compared with that of human IFN-alpha 2a and 

10 consensus human IFN-alpha. These results indicate that compositions comprising an 

inlerferon-alpha homologue of the present invention can be used in methods to inhibit or 
reduce proliferation of tumor cells, including, but not limited to: human carcinoma cells, 
hematopoietic cancer cells, human leukemia cells, human lymphoma cells, and human 
melanoma cells. Inhibition can be performed in vitro (useful, e.g,^ in a variety of 

15 proliferation assays), ex vivo or in vivo (useful, e.g.. as a therapeutic or prophylactic 
agent). 

Interferon-alpha homologues of the present invention show diverse activity 
patterns against a variety of cancer cell lines {see, e.g.. Example 2). An in vitro cell line 
screen (as described in, e.g.. Monks, A. etal. (1991) J. NatM Cancer Inst. 83:757-766) was 

20 used to assay interferon-alpha homologues of the invention for selective growth inhibition 
and/or cell killing of panicular cancer cell lines. The human cancer cell lines screened 
(see, e.g.. Example 2, Table 3) include leukemias, melanomas, and cancers of the lung, 
colon, brain, central nervous system, ovary, breast, prostate, and kidney. 

Three activity parameters were determined in the cancer cell line screen: 1) 

25 GI50 ("growth inhibition at 50%"), a measure of growth inhibition activity, is the 

concentration of interferon test sample (IFN alpha homologue or control DFN alpha) at 
which cell growth is inhibited by 50%, as measured by a 50% reduction in the net 
protein/polypeptide increase in the interferon test sample as compared to that observed in 
the control cells (no test sample) at the end of the incubation period; 2) TGI ("total growth 

30 inhibition") a measure of cytostatic activity, is the concentration of interferon test sample 
at which cell growth of a particular cell line is totally inhibited, wherein the amount of 
cellular protein at the end of the incubation period equals the amount of cellular protein at 
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ihe beginning of the incubation period ; and 3) LC50, a measure of cytotoxic activity, is 
the concentration of interferon lest sample at which a 50% reduction in the measured 
amount of cellular protein at the end of the incubation as compared to that at the beginning 
of the incubation period is observed, indicating a net loss of cells following interferon test 
5 sample addition. Further details of the assay and data analysis procedures are provided in 
Example 2. 

The activity parameters of exemplary inierferon-alpha homologue 3DA1 1 
(SEQ ID NO:40) against a variety of cancer cell lines are shown in Figs. 3A, 3B. and 3C, 
in comparison with the interferon-alpha Conl and human interferon-alpha 2a controls. 
10 With respect to growth inhibition activity, in particular, homologue 3DA1 1 

and control interferon-alpha Conl showed significant activity against most of the cell lines 
tested, with the interferon-alpha Conl exhibiting generally higher activity, and interferon- 
alpha 2a generally exhibiting lower overall activity and in only a subset of the cell lines 
(Fig.3A). 

15 In contrast, in particular, a pronounced difference was observed in the 

cytotoxic and cytostatic activities of homologue 3DA1 1 in comparison to both interferon- 
Conl and human interferon-alpha 2a controls. In the concentration range tested, 
homologue 3DA1 1 showed significant cytostatic activity against a population of cells of 
eleveaof the cell lines, while interferon-Conl showed activity against only a population 

20 of cells of one of the cell lines, against which homologue 3DA 1 1 was also active (Fig. 
3B). IFN-alpha 2a, on the other hand, was not active in this assay against any of the 
Htested cell lines. Homologue 3DA11 thus has a broader cytostatic activity profile than 
consensus human interferon-alpha (Conl) and human interferon-alpha 2a. 

Homologue 3DA1 1 also showed significant cytotoxic activity in 

25 comparison to the interferon-Conl and human interferon-alpha 2a controls (Fig. 3C). 

Surprisingly, homologue 3DA11 displayed cytotoxic activity against a population of cells 
of 8 of the cell lines, whereas neither the interferon-Conl nor the interferon-alpha 2a 
controls exhibited measurable activity against a population of cells of any of the cell lines 
at the concentration range employed in the assay. Thus, homologue 3DA1 1 also has a 

30 broader cytotoxic activity profile than interferon-Conl and human interferon-alpha 2a. 

Figs. 4A-4D illustrate the cytostatic activity (as reflected by the TGI value) 
of exemplary interferon-alpha homologues of the invention. In each figure, the relative 
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cytostatic activity (expressed as -log TGI) against a population of cells of particular cancer 
cell line is plotted for various interferon-alpha homologues and for the two control 
interferons (inierferon-Conl and human interferon-alpha 2a). 

Of the exemplary homologues tested, homologues 1D3 (SEQ ID NO:54) 

5 and 3DA1 1 (SEQ ID NO:40), but neither of the control interferons, exhibited significant 
cytostatic activity against a population of cells of leukemia cell line RPMI-8226 over the 
concentration range of the assay (Fig. 4 A). In this example, the 1D3 and 3DA1 1 
homologues showed at least about 25-fold higher cytostatic activity against a population 
of the cells (corresponding to a difference in TGI of at least about 1.4 log units) than did 

10 either of the controls (interferon-Conl or interferon-alpha 2a) against a population of cells 
of the leukemia cell line. 

Homologues 1D3, 2G5 (SEQ ID NO:45), 6CG3 (SEQ ID NO:52) and 
3DA11, but neither of the control interferons, exhibited significant cytostatic activity 
against lung cancer cell line NCI-H23 (Fig. 4B). In this example, the 1D3, 2G5, 6CG3. 

15 and 3DA1 1 homologues showed at least about 12-fold higher cytostatic activity a 

population of cells of a lung cancer cell line (corresponding to a difference in TGI of at 
least about 1.1 log units) than either interferon-Conl or interferon-alpha 2a against a 
population of cells of the lung cancer cell line. 

Homologues 1D3, 2G5, and 3DA11, but neither of the control interferons, 

20 showed significant cytostatic activity against a population of cells of renal cancer cell line 
ACHN (Fig. 4C). In this example, the 1D3, 2G5, and 3DA1 1 homologues showed at least 
about 35-fold higher cytostatic activity a population of cells of said renal cancer cell line 
(corresponding to a difference in TGI of at least about 1.55 log units) than either 
interferon-Conl or interferon-alpha 2a against a population of cells of renal cancer cell 

25 line. 

Homologues 1D3, 2G5, 3DA11. 2CA5 (SEQ ID NO:42) and 2DB11 (SEQ 
ID NO:41), and the interferon-Conl control, but not the interferon alpha-2a control, 
exhibited significant cytostatic activity against a population of cells of an ovarian cancer 
cell line OVCAR-3 (Fig. 4D). In this example, homologue 1D3 showed at least about 2- 
30 fold higher cytostatic activity (corresponding to a difference in TGI of at least about 0.3 
log units) than interferon-Conl, and the 1D3, 2G5, 3DA11, 2CA5, and 2DB11 
' homologues showed at least about 40-fold higher cytostatic activity (corresponding to a 
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difference in TGI of at least about 1.6 log units) than interferon-alpha 2a, against 
respective populations of cells of the ovarian cancer cell line. 

Fronn the exemplary data provided herein, it is apparent that interferon- 
alpha homologues of the invention showed a variety of cytostatic activity profiles, which 

5 differed significantly from those of the interferon-alpha Conl and interferon alpha-2a. 

The present invention includes an interferon-alpha homologue having 
increased cytostatic activity relative to human interferon-alpha 2a or to consensus human 
interferon-alpha, Conl. In various embodiments, the interferon-alpha homologue has at 
least about 2-fold higher cytostatic activity a population of cells of a cancer cell line (Le,, 

10 has a TGI value at least about 2-fold lower) than does human interferon-alpha 2a, or has at 
least 2-fold higher cytostatic activity than interferon-Conl, against a population of cells of 
one or more cancer cell lines selected from the following: a leukemia cell line; a 
melanoma cell line; a lung cancer cell line; a colon cancer cell line; a central nervous 
system (CNS) cancer cell line; an ovarian cancer cell line; a breast cancer cell line; a 

15 prostate cancer cell line; and a renal cancer cell line. 

In other embodiments, the interferon-alpha homologue has at least about 5- 
fold higher cytostatic activity a population of cells of a cancer cell line (i.e., has a TGI 
value at least about 5-fold lower) than does human interferon-alpha 2a, or has at least 
about 5-fold higher cytostatic activity than interferon-Conl, against a population of cells 

20 of one or more cancer cell lines selected from the following: a leukemia cell line; a 
melanoma cell line; a lung cancer cell line; a colon cancer cell line; a central nervous 
system (CNS) cancer cell line; an ovarian cancer cell line; a breast cancer cell line; a 
prostate cancer cell line; and.a renal cancer cell line. In other embodiments, the 
interferon-alpha homologue has at least about 10-fold higher cytostatic activity a 

25 population of cells of a cancer cell line (i.e., has a TGI value at least about 10-fold lower) 
than does human interferon-alpha 2a, or has at least about 10-fold higher cytostatic 
activity than interferon-Conl, against a population of cells of one or more cancer cell lines 
selected from the following: a leukemia cell line; a melanoma cell line; a lung cancer cell 
line; a colon cancer cell line; a CNS cancer cell line; an ovarian cancer cell line; a breast 

30 cancer cell line; a prostate cancer cell line; and a renal cancer cell line. 

The invention includes an interferon-alpha homologue having increased 
cytotoxic activity relative to human interferon-alpha 2a or relative to interferon-Conl . In 
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various embodiments, the inlerferon-alpha homologue has at least about 2-fold higher 
cytotoxic activity {i.e., has an LC50 value at least about 2-fold lower), at least 5-fold 
higher cytotoxic activity, or at least 10-fold higher cytotoxic activity, than human 
interferon-alpha 2a against a population of cells of one or more cancer cell lines selected 
5 from the following: a leukemia cell line; a melanoma cell line; a lung cancer cell line; a 
colon cancer cell line; a CNS cancer cell line; an ovarian cancer cell line; a breast cancer 
cell line; a prostate cancer cell line; and a renal cancer cell line. In other embodiments, the 
interferon-aipha homologue has at least about 2-fold higher cytotoxic activity {i.e., has an 
LC50 value at least about 2-fold lower), at least about 5-fold higher cytotoxic activity, or 
10 at least about 10-fold higher cytotoxic activity, than interferon-Conl, against a population 
of cells of at least one cancer cell line selected from: a leukemia cell line; a melanoma cell 
line; a lung cancer cell line; a colon cancer cell line; a CNS cancer cell line; an ovarian 
cancer cell line; a breast cancer cell line; a prostate cancer cell line; and a renal cancer cell 
line. 

15 The invention includes an interferon-alpha homologue having increased 

growth inhibition activity relative to human interferon-alpha 2a or to interferon-Conl . In 
various embodiments, the interferon-alpha homologue has at least about 2-fold higher 
growth inhibition activity {/.c, has a GI50 value at least about 2-fold lower), at least about 
5-fold higher growth inhibition activity, or at least about 10-fold higher growth inhibition 

20 activity, than human inlerferon-alpha 2a, against a population of cells of one or more 
cancer cell lines selected from: a leukemia cell line; a melanoma cell line; a lung cancer 
cell line; a colon cancer cell line; a CNS cancer cell line; an ovarian cancer cell line; a 
breast cancer cell line; a prostate cancer cell line; and a renal cancer cell line. In other 
embodiments, the interferon-alpha homologue has at least about 2-fold higher growth 

25 inhibition activity {i.e., has a G150 value at least about 2-fold lower), at least about 5-fold 
higher growth inhibition activity, or at least about 10-fold higher growth inhibition 
activity, than interferon-Conl, against at least one cancer cell line selected from the 
following: a leukemia cell line; a melanoma cell line; a lung cancer cell line; a colon 
cancer cell line; a CNS cancer cell line; an ovarian cancer cell line; a breast cancer cell 

30 line; a prostate cancer cell line; and a renal cancer cell line. 

The discovery set forth herein that interferons (such as the interferon-alpha 
homologues described herein) can be evolved, modified, or recombined to display a 
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variety of activity profiles provides an opportunity for evolving and creating customized 
and specific interferon homologues for the treatment of a variety of specific diseases or 
disease conditions, including, e.g,, a variety of cancers or related conditions. For example, 
an interferon homologue of the invention optimized to have increased potency against a 
5 particular target cancer cell type may also be optimized to have (advantageously) reduced 
toxicity towards a non-target cell(s), and thus may produce lower side effects in the 
subject to which the homologue is administered (e.g., patient). 

The present invention further provides an opportunity to optimize 
interferon homologues against tumor cells taken from a subpopulation of subjects (e.g., 

10 manimals or human patients), or even from an individual subject {e.g., mammal or human 
patient), providing therapeutic or prophylactic treatment tailored to the individual subject. 
Optimized interferon homologues of the invention may provide therapeutic or 
prophylactic benefit against cancers or related conditions or other interferon-treatable 
disorders or conditions which are otherwise unresponsive to currently-available interferons 

15 or to other treatment regimes. 

ANTIVIRAL PROPERTIES OF INTERFERON HOMOLOGUES 

The antiviral activity of interferon homologues of the present invention was 
evaluated in a human WISH cell/EMCV assay as described in Example I. Fig. 2 shows 
the antiviral activity of exemplary interferon homologues of the invention comprising 

20 amino acid sequences SEQ ID NO:36 to SEQ ED NO:54. 

Improved in vitro antiviral activity of exemplary IFN-alpha homologues of 
the invention has been shown to be maintained in vivo in a murine model system. Two 
IFN-alpha homologues of the invention, designated CH2.2 and CH2.3 (SEQ ID NOS:84 
and 85, respectively), were previously shown to have about 206,000-fold and 138,000-fold 

25 improved antiviral activity, respectively, compared to human IFN-alpha 2a in a murine 
cell-based assay, as well as significantly higher activity in the same assay as compared to 
native murine interferons (Chang et al (1999) Nature Biotechnol 17:793-797). As 
described in Example 3 below, Balb/c mice challenged with a lethal dose of vesicular 
stomatitis virus (VSV) were administered varying doses of IFN-alpha homologues, 

30 designated CH2.2 and CH2.3, native murine interferon Mu-IFN alpha 4, and human IFN- 
alpha 2a. The high in vitro activity correlated well with the observed in vivo activity (Fig. 

5). The CH2.2 and CH2.3 homologues were fully effective in protecting mice from the 
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lethal viral challenge, while the same dosage of the native murine interferon was partially 
effective and the human IFN-alpha 2a was completely ineffective. These results indicate 
that compositions comprising interferon homologues of the present invention can be used 
in methods to inhibit viral replication in subjects infected with viruses including, but not 
5 limited to: human immunodeficiency virus (HIV), hepatitis C virus (HCV), herpes simplex 
virus (HSV), and hepatitis B virus (HBV). Inhibition can be performed in vitro ( useful, 
e.g., in a variety of antiviral assays), ex vivo (useful e.g,, as a therapeutic or prophylactic 
agent in ex vivo methods discussed herein), or in vivo ( useful, e.g., as a therapeutic or 
prophylactic agent in in vivo methods discussed herein). 

10 INTERFERON HOMOLOGUES IN THE TREATMENT OF AUTOIMMUNE AND 
OTHER IMMUNE-RELATED DISORDERS 

Compositions of the present invention can be used to therapeutically or 

prophylactically treat and thereby alleviate a variety of immune system-related disorders 

characterized by hyper- or hypo-active immune system function or other features. Such 

15 disorders include hyperallergenicity and autoimmune disorders, such as multiple sclerosis, 

type I (insulin dependent) diabetes mellitus, lupus erythematosus, amyotrophic lateral 

sclerosis, Crohn's disease, rheumatoid arthritis, stomatitis, asthma, allergies, psoriasis and 

the like. 

THERAPEUTIC AND PROPHYLACTIC COMPOSITIONS 
20 Therapeutic or prophylactic compositions comprising one or more 

interferon homologue polypeptides or nucleic acids of the invention are tested in 
appropriate in vitro, ex vivo, and in vivo animal models of disease, to confirm efficacy, 
tissue metabolism, and to estimate dosages, according to methods well known in the art. 
In particular, dosages can be determined by activity comparison of the alpha interferon 
25 homologues to existing alpha interferon therapeutics or prophylactics, i.e., in a relevant 
assay. In one aspect, the invention provides methods comprising administering one or 
more interferon homologue nucleotides or polypeptides of the invention (or fragments 
thereof) described above to a mammal, including, e.g., a human, primate, mouse, pig, cow, 
goat, rabbit, rat, guinea pig, hamster, horse, sheep; or a non-mammalian vertebrate such as 
30 a bird (e.g., a chicken or duck) or a fish, or invertebrate, as described in greater detail 
■ below. Such compositions typically comprise one or more interferon homologue 

91 



xcic- <wc 



PCT/ljSOO/27781 

WO 01/25438 

iriphosphales and a nucleic ac.d polymerase, e.g., 
the library contains at least about 5. 10. 20. 50, 

„f pharmaceutical compos.uons of .he P"'"""" . „f ^ prophy.ac.ic. 

P„,,p.p.aeco.po*nsca^hea^^^^^^ 

.herapeutic. a„a *agnosUc .chods ^-"^ ; ,„„3,,™,. 3„hcu»neous, 

^ ,«,e<. .o oral, infravenou.. '"-P-™"'' " ^2„„ ,„,rf„„„ homologue 
, .opica.,s..«n,ua,,va,„a..»ec,a,.ea„^^^^^^^^^^^ 3.h»— on 

pol^pUdcc^posUionscanafsohca— ^^^^^^ 
30 rou,=san<.approprta«fo™u,a,,o.sa.gc^^V.^^^^^ ^^^^ 

The .merferon homologue pofypcp a= ^ ^^^^^^^^ 
co™bina.io„wi.ho.hersuiub,ecompo„en,s.cana.ohema 



10 



pCT/USOO/27781 . 

>V0 01/2543H 

K 0— av.a»a,a.ion. Aerosol formuUrtons can 
Lsu.u,an.o„s,ou.es,.c.aea,u^a„- 

.„,„.i<.s, «h.h can conuin ^--"'^tr^ -pien,. and a,ue»>s and non- 
,„™„,aUon iso.on. wUh .h. *od ^^^^^^ ,„aon,ng 

. 3,ueous s.e.,c sospenslons .ha. J^,^^^ „„c>eic acd can ^ 

3gen.s. ..abm^s. and P''" ^^,^^ such as ampu.« and via.s. 

p,.sen<ed in -nU-dose or n,u..-dosc sealed o ^ ^^^^^ ^ 

„e.hods of adn.-.n..ra.ion. .n P^™'; „„h fo™o,a.lons in cunen. 

cells .tansduced «..h *« .n.erfe.o ^^^^^^ i„,„v.nously or 

...,n.hec„n.e«or-i.orM..-^^^^^^ 

parenreranyasdescriheda^ve. «w..^^^ ^_^^^_^^,^,,,„usor 
(..g.. human pa-ien.s) is rou..ne.e.*..del>-'y 

i„,raperi.oneal adminis.ra.ion. oolypep<W= °' . 

Thedoseonn.er,eronho.o^^'^^P^^^^^^_^^^^^^„.,„„„,on.s 

,v=n.ion ad.inis.ered .o a su.jec. ^-^^^ „,^se ,n .he suhiec, (.|..^ 

,uff,ci.n..oenec.ahenencial.h«a^«.-^^^^^^^^^ 
paUen.) over Ome. or ,o ,nh.bi. .ntecon h^a ^^^^^ ^ ^^^^^^^^^ 

Ise «i« he de.enr,ined hy .he 7,^„„,,,„„ „,„e pa.ien,, as .ell as .he 

»„Wi.y i^erferon ho.o,o,„e employed ^ ^ ^ . 

^, weigh, or surface area o, .he pa-.e - - ^^^.^„„, accompany 

..er,.inedh,.hee.is.ence.na.u.an^^^^^^^^^^^^ 
30 ,headn»nis.ra.ionofapa™cularvcc.or, 

1 Mofipnt 



20 



25 



particular patient. ^ 

I." 



93 



wo 01/25438 



PCT/USOO/27781 



In the therapeutic and prophylactic treatment methods of the invention 
described herein, an effective amount of an interferon-alpha nucleic acid (e.g., DNA or 
mRNA) of the invention (e.g., nucleic acid dosage) will generally be in the range of. e.g., 
from about 0.05 microgram/kilogram (kg) to about 50 mg/kg, usually about 0.005-5 
5 mg/kg. However, as will be understood, the effective amount of the nucleic acid (e.g., 
nucleic acid dosage) and/or polpeptide {e.g., polypeptide dosage) will vary in a manner 
apparent to those of ordinary skill in the art according to a number of factors, including the 
activity or potency of the polypeptide, the activity or potency of any nucleic acid construct 
(e.g,, vector, promoter, expression system) to be administered, the disease or condition 
10 (e.g., particular cancer) to be treated, and the subject to which or whom the nucleic acid is 
delivered- 

For delivery of some polypeptides, e.g., by delivering nucleic acids 
encoding such polypeptides, for example, adequate levels of translation and/or expression 
are achieved with a nucleic acid dosage of, e.g., about 0.005mg/kg to about 5 mg/kg. 
15 Dosages for other polypeptides (and nucleic acids encoding them) having a known 

biological activity can be readily determined by those of skill in the art according to the 
factors noted above. Dosages used for other known interferon-alphas for particular 
diseases provide guidelines for determining dosage and treatment regimen for a nucleic 
acid or polypeptide of the invention. An effective amount of an interferon-alpha 
20 homologue polypeptide may be in the range of from about 1 microgram to about 1 
milligram, and more typically from about 1 microgram to about 100 micrograms. 

A composition for use in therapeutic and prophylactic treatment methods of 
the invention described herein may comprise, e.g., a concentration of an interferon-alpha 
homologue nucleic acid (e.g., DNA or mRNA) of the invention of from about 0.1 
25 microgram/milliliter (ml) to about 20 mg/ml and a pharmaceutically acceptable carrier 
(e.g., aqueous carrier). 

A composition for use in therapeutic and prophylactic treatment methods of 
the invention described herein may comprise, e.g., a concentration of an interferon-alpha 
homologue polypeptide of the invention in an amount as described above and herein and a 
30 pharmaceutically acceptable carrier (e.g. , aqueous carrier). 

In determining the effective amount of the vector, cell type, or formulation 
^ to be administered in the treatment or prophylaxis of cancers or viral diseases, the 

94 



OC»D. <WC 0125438AS_I > 



wo 01/25438 



PCT/USO0/2778I 



physician evaluates circulating plasnia levels, vector/cell/formulation/ interferon 
homologue toxicities, progression of the disease, and the production of anti- 
vector/interferon homologue antibodies. 

The dose administered, e.g,, to a 70 kilogram patient will be in the range 

5 equivalent to dosages of currently-used interferon-alpha therapeutic or prophylactic 

proteins, and doses of vectors or cells which produce interferon homologue sequences are 
calculated to yield an equivalent amount of interferon homologue nucleic acid or 
expressed protein. The vectors of this invention can supplement treatment of cancers and 
virally-mediated conditions by any known conventional therapy, including cytotoxic 

10 agents, nucleotide analogues (e.g., when used for treatment of HIV infection), biologic 
response modifiers, and the like. 

For administration, interferon homologues and transduced cells of the 
present invention can be administered at a rate, determined by the LD-50 of the interferon 
homologue polypeptide or nucleic acid, vector, or transduced cell type, and the side- 

15 effects of the interferon homologue polypeptides or nucleic acids, vector or cell type at 
various concentrations, as applied to the mass and overall health of the patient. 
Administration can be accomplished via single or divided doses. 

For introduction of recombinant alpha-interferon nucleic acid transduced 
cells into a subject (e.g., patient), blood samples are obtained prior to infusion, and saved 

20 for analysis. Between 1 X 10^ and 1 X lO'^ transduced cells are infused intravenously 
over 60- 200 minutes. Vital signs and oxygen saturation by pulse oximetry are closely 
monitored. Blood samples are obtained 5 minutes and 1 hour following infusion and 
saved for subsequent analysis. Leukopheresis, transduction and reinfusion are optionally 
repeated every 2 to 3 months for a total of 4 to 6 treatments in a one year period. After the 

25 first treatment, infusions can be performed on a outpatient basis at the discretion of the 
clinician. If the reinfusion is given as an outpatient, the participant is monitored for at 
least 4, and preferably 8 hours following the therapy. Transduced cells are prepared for 
reinfusion according to established methods. See Abrahamsen et al. (1991) 7. Clin. 
Apheresis 6:48-53; Carter et al (1988) 7. Clin. Arpheresis 4:113-1 17; Aebersold et al 

30 (1988), /. Immunol Methods 112:1-7; Muul et al (1987) J. Immunol Methods 101:171- 
181 and Carter et al (1987) Transfusion 27:362-365. After a period of about 2-4 weeks in 
^ culture, the cells should number between 1 X 10^ and 1 X 10*1 In this regard, the growth 
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Characteristics of cells va.^ fron. patient to patient and from cell type to cell type. About 
72 hours pnor to reinfusion of the transduced cells, an aliquot is taken for analysis of 
phenotype. and percentage of cells expressing the therapeutic or prophylactic agent. 

If a subject (e.g., patient) undergoing infusion of a vector or transduced cell 
or protein formulation develops fevers, chills, or muscle aches, he/she receives the 
appropriate dose of aspirin, ibuprofen. acetaminophen or other pain/fever controlling druo 
Subjects (e.g., patients) who experience reactions to the infusion such as fever, muscle " 
aches, and chills are premedicated 30 minutes prior to the future infusions with either 
aspirin, acetaminophen, or. e.g., diphenhydramine. Meperidine is used for more severe 
chills and muscle aches that do not quickly respond to antipyretics and antihistamines. 
Cell infusion is slowed or discontinued depending upon the severity of the reaction. 

THERAPEUTfC AND PROPHYI.Amr TPPat ment N/TFTRr^T^c 

The present invention also includes methods of therapeutically or 
prophylactically treating a disease or disorder by administering v/v. or e.r .i.o one or 
15 more nucleic acids or polypeptides of the invention described above (or compositions 

compnsmg a pham^aceutically acceptable excipient and one or more such nucleic acids or 
polypeptides) to a subject, including, e.g.. a mammal, including, e.g., a human, primate, 
mouse, p.g, cow, goat, rabbit, rat, guinea pig. hamster, horse, sheep; or a non-mammalian 
vertebrate such as a bird {e.g., a chicken or duck) or a fish, or invertebrate. 
20 In one aspect of the invention, in ex vivo methods, one or more cells or a 

population of cells of interest of the subject {e.g., tumor cells, tumor tissue sample, organ 
cells, blood cells, cells of the skin, lung, heart, muscle, brain, mucosae, liver, intestine, 
spleen, stomach, lymphatic system, cervix, vagina, prostate, mouth, tongue etc ) are 
obtainedor removed from the subject and contacted with an amount of a polypeptide of 
25 the mvention that is effective in prophylactically or therapeutically treating the disease 
disorder, or other condition. The contacted cells are then returned or delivered to the 
subject to the site from which they were obtained or to another site {e.g., including those 
defined above) of interest in the subject to be treated. If desired, the contacted cells may 
be grafted onto a tissue, organ, or system site (including all described above) of interest in 
JO the subject using standard and well-known grafting techniques or. e.g., delivered to the 
blood or lymph system using standard delivery or transfusion techniques. 
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comprising a target nucleic acid sequence of the invention that encodes a biologically 
active polypeptide of interest (e.g., a polypeptide of the invention) that is effective in 
prophylaclically or therapeutically treating the disease, disorder, or other condition. The 
one or more cells or population of cells is contacted with a sufficient amount of the 

5 polynucleotide construct and a promoter controlling expression of said nucleic acid 

sequence such that uptake of the polynucleotide construct (and promoter) into the cell(s) 
occurs and sufficient expression of the target nucleic acid sequence of the invention results 
to produce an amount of the biologically active polypeptide effective to prophyiactically 
or therapeutically treat the disease, disorder, or condition. The polynucleotide construct 

10 may include a promoter sequence (e.g., CMV promoter sequence) that controls expression 
of the nucleic acid sequence of the invention and/or. if desired, one or more additional 
nucleotide sequences encoding at least one or more of another polypeptide of the 
invention, a cytokine, adjuvant, or co-stimulatory molecule, or other polypeptide of 
interest. 

15 Following transfection, the transformed cells are returned, delivered, or 

transferred to the subject to the tissue site or system from which they were obtained or to 
another site (e.g., tumor cells, tumor tissue sample, organ cells, blood cells, cells of the 
skin, lung, heart, muscle, brain, mucosae, liver, intestine, spleen, stomach, lymphatic 
system, cervix, vagina, prostate, mouth, tongue, ere) to be treated in the subject. If 

20 desired, the cells may be grafted onto a tissue, skin, organ, or body system of interest in 
the subject using standard and well-known grafting techniques or delivered to the blood or 
lymphatic system using standard delivery or transfusion techniques. Such delivery, 
administration, or transfer of transformed cells is typically made by using one or more of 
the routes or modes of administration described above. Expression of the target nucleic 

25 acid occurs naturally or can be induced (as described in greater detail below) and an 

amount of the encoded polypeptide is expressed sufficient and effective to treat the disease 
or condition at the site or tissue system. 

In another aspect, the invention provides in vivo methods in which one or 
more cells of interest or a population of cells of the subject (e.g., including those cells and 

30 cells systems and subjects described above) are transformed in the body of the subject by 
contacting the cell(s) or population of cells with (or administering or transferring to the 
V cell(s) or population of cells using one or more of the routes or modes of administration 
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adminUte^d or d=Hver«i .o .he subjec. as described above ,n an amoun. effecve ,o ,rea. 

the disease or disorder. 

in another aspec, in each ■» vivo and « v/vo .reatmen, me.hod described 
above, .he amoun. of po,ynacleo.ide adn.n,s.e,ed .o .he celKs) or subjec. can be an 
amoun. sufficien, .ha, up.a.e of said polynucleotide imo one or more cells of .he subjec. 
occurs and suff.cien. expression of said nucleic acid sequence resuUs .o produce an 
amoun. of a biologically acUve poly^piide effecive .o enhance an immune response ,n 
.he subjeC, including an immune response induced by an immunogen anl.gen In 
a„o.he, aspec, for each such me.hod, d,e amoun. of polypep.ide adminisiered .o cell(s) or 
,0 subjec. can be an amcum sufficien. .0 enhance an immune response in .he subject, 
including ihatindiKcd by an immunogen (e.g., antigen). 

,„ ye. another aspec., in an ,n vivo or in vivo .rea-mem method .n wh,ch a 

polynucleotide construct (or composition comprising a polynucleotide consuuct, is used to 
deliver a physiologically active polypeptide to a subject, the expression of the 

,5 polynucleotL constn.. can be induced by using an inducble on- and off-gene exp«ss,on 
sys«m. Examples of such on- and off-gene express.on systems include theTet-On 
Gene Expression System and Te-Off- Gene Expression System Uee. e.g.. Clomech 
Ca.alog2000.pg. UO-.l. f„ a de.ailed descrip.ion o, each such sysiem), respecvely. 
Other controllable or inducible on- and off-gene expression systems are known ,o .hose of 

20 ordi„arysldllin*ear..WiU,suchsysttm,expressionof.he.arge.nucl=icof.he 

polynucleotide construct can be regula^d in a precise, revers.bic, and <,uant,..,ve manner. 
Oeneexpression of the .arge. nucleic acid can be induced, for example, af.er .he s.able 
transtectedcellscontaimngthepolynucleotide constructcompHsing the target nuc e,cac,d 

are delivered or transferred to or made to con.ac. the tissue site, organ or system of 
25 interest. Such systems are of parUcula, benefit in treatment methods and formau ,n wh.ch 
,s advantageous to delay or precisely control expression of the urge, n^leic acd (e.g. 
,„ allow .ime for compleuon of surgery and/or healing following surgery; .0 allow „me for 

.he polynucleo.ide consirMC. comprising the targe, nucleic acid to reach the s,te cells 
system, or tissue to be treated; to allow time for .he graft containing cells uans o^ed «^th 
30 the construct to become inco^rated into the tissue or organ onto or into whtch „ has been 
spliced or attached, eJc.) 
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Specialized alignment programs such as BLAST can also be incorporated into the systems 
of the invention for alignment of nucleic acids or proteins (or corresponding character 
strings). 

Integrated systems for analysis in the present invention typically include a 

5 digital computer with GO software for aligning sequences, as well as data sets entered into 
the software system comprising any of the sequences herein. The computer can be, e.g., a 
PC (Intel x86 or Pentium chip- compatible DOS™, 032™ WINDOWS™ WINDOWS 
m^^, WINDOWS95™, WINDOWS98TM LINUX based machine, a MACINTOSiF^S 
Power PC, or a UNIX based {e.g., SUN™ work station) machine) or other commercially 

10 common computer which is known to one of skill. Software for aligning or otherwise 
manipulating sequences is available, or can easily be constructed by one of skill using a 
standard programming language such as Visualbasic, Fortran, Basic, Java, or the like. 

Any controller or computer optionally includes a monitor which is often a 
cathode ray tube ("CRT") display, a fiat panel display {e.g., active matrix liquid crystal 

15 display, liquid crystal display), or others. Computer circuitry is often placed in a box 
which includes numerous integrated circuit chips, such as a microprocessor, memory, 
interface circuits, and others. The box also optionally includes a hard disk drive, a floppy 
disk drive, a high capacity removable drive such as a writeable CD-ROM, and other 
common peripheral elements. Inputting devices such as a keyboard or mouse optionally 

20 provide for input from a user and for user selection of sequences lo be compared or 
otherwise manipulated in the relevant computer system. 

The computer typically includes appropriate software for receiving user 
instructions, either in the form of user input into a set parameter fields, e.g., in a GUI, or in 
the form of preprogrammed instructions, e.g., preprogranmied for a variety of different 

25 specific operations. The software then converts these instructions to appropriate language 
for instructing the operation of the fluid direction and transport controller to carry out the 
desired operation. 

The software can also include output elements for controlling nucleic acid 
synthesis {e.g., based upon a sequence or an alignment of a sequences herein) or other 
30 operations which occur downstream from an alignment or other operation performed using 
a character string corresponding to a sequence herein. 
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In one enabodiment, the invention provides an integrated system 
comprising a computer or computer readable medium comprising a database having one or 
more sequence records. Each of the sequence records comprises one or more character 
strings corresponding to a nucleic acid or polypeptide or protein sequence selected from 
SEQ ID N0:1 to SEQ ID NO:85. The integrated system further comprises a use input 
interface allowing a use to selectively view the one or more sequence records. In one such 
integrated system, the computer or computer readable medium comprises an alignment 
instruction set that aligns the character strings with one or more additional character 
strings corresponding to a nucleic acid or polypeptide or protein sequence. 

One such integrated system includes an instruction set that comprises at 
least one of the following: a local homology comparison determination, a homology 
alignment determination, a search for similarity determination, and a BLAST 
determination. In some embodiments, the system further comprises a readable output 
element that displays an alignment produced by the alignment instruction set. In another 
embodiment, the computer or computer readable medium further comprises an instruction 
set that translates at least one nucleic acid sequence which comprises a sequence selected 
from SEQ ID NO:l to SEQ ID NO:35 or SEQ ID NO:72 to SEQ ID NO:78 into an amino 
acid sequence. The instruction set may select the nucleic acid by applying a codon usage 
instruction set or an instruction set which determines sequence identity to a test nucleic 
acid sequence. 

Methods of using a computer system to present information pertaining to at 
least one of a plurality of sequence records stored in a database are also provided. Each of 
the sequence records comprises at least one character string corresponding to SEQ ID 
NO: 1 to SEQ ID NO:85. The method comprises determining at least one character string 
corresponding to one or more of SEQ ID N0:1 to SEQ ID NO:85 or a subsequence 
thereof; determining which of the at least one character string of the list are selected by a 
user; and displaying each of the selected character strings, or aligning each of the selected 
character strings with an additional character string. The method may further comprise 
displaying an alignment of each of the selected character strings vyith an additional 
character string and/or displaying the list. 
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KITS 

In an additional aspect, the present invention provides kits embodying the 
methods, composition, systems and apparatus herein. Kits of the invention optionally 
comprise one or more of the following: (1) an apparatus, system, system component or 
5 apparatus component as described herein; (2) instructions for practicing the methods 
described herein, and/or for operating the apparatus or apparatus components herein 
and/or for using the compositions herein; (3) one or more alpha interferon homologue 
compositions (such as e.g.. compositions comprising at least one interferon alpha 
homologue nucleic acid or polypeptide or fragment thereof, cell, vector, etc., of the 
10 invention) or components (interferon alpha homologue nucleic acid or polypeptide or 
fragment thereof, cell, vector, etc., of the invention); (4) a container for holding one or 
more aspects of the invention, including such components or compositions, and (5) 
packaging materials. 

In a further aspect, the present invention provides for the use of any 
15 apparatus, apparatus component, composition or kit herein, for the practice of any method 
or assay herein, and/or for the use of any apparatus or kit to practice any assay or method 
herein. 

EXAMPLES 

EXAMPLE I: PREPARAT ION AND SCREENING OF SHT FFFT F D INTERFERON- 
20 ALPHA LIBRARffiS 

Fragments (25-60 base pairs (bp) in length) of about 20 human interferon- 
alpha subspecies genes were prepared by PCR amplification and DNAse treatment, and 
recombined essentially as described in Crameri A. et al. (1998; Nature 15:288-291), to 
produce shuffled interferon-alpha mature coding sequences. Expression libraries were 

25 prepared by subcloning shuffled interferon-alpha mature coding sequences into an E. colt 
secretion vector. Shuffled interferon polypeptides were expressed as mature proteins 
fused at the C-termini to an E tag (Amersham-Pharmacia) to facilitate quantitation and 
purification from the periplasmic space. E. coli transformants were picked using a robotic 
colony picker (Q-Bot. Genetix Pharmaceuticals) into microtiter plates, and periplasmic 

30 extracts were prepared. 

Periplasmic extracts were assayed for antiproliferative activity on a human 
Daudi cell line as described by Scarozza. A.M. etal. (1992) J. Interferon Res. 12:35-42. 
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Clones exhibiUng antiproliferative activity in the Daudi assay were re- 
screened and expression levels determined by Western blot using an anti-E tag antibody 
(Amersham-Phannacia). Clones exhibiting highest activity normalized to expression 
levels were selected for sequencing and were also utilized as substrates for additional 
5 rounds of shuffling and screening as described above. 

Clones from the first and second rounds of shuffling having relatively high 
antiproliferative activity by the Daudi assay were subcloned into a CHO expression vector 
(pDEI-1011) in which the E-tag/6-His tag (Amersham-Phannacia) is fused to the C- 
terminus of the shuffled interferons. Clones were transfected into CHO cells and stable 
10 cell lines were selected with 1 mg/mi G418. CHO-expressed mature interferons were 
purified on anti-E tag Sepharose column (Amersh am -Pharmacia) and quantitated by a 
Bradford assay (Biorad). CHO-purified shuffled interferons were assayed for 
. antiproliferative activity by the Daudi assay and for antiviral activity using a human WISH 
cell/EMCV assay as described below. 

IS Human WISH c ^-H / FMCV antiviral assay 

WISH cells were seeded to a density of 6 x 10^ cells/well m 96-well plates 

in 100 ul RPMI medium (Gibco-BRL) supplemented with 10% fetal calf serum, penicillin 

(100 ng/ml), and streptomycin (100 jig/ml). and incubated for 24 hours at 37°C. Samples 

of interferon-alpha polypeptides in medium (100 ul total volume) were added to wells and 

20 incubated for 3 hours at 37°C under a 5% CO2 atmosphere. Dilutions of EMCV 

(encephalomyocarditis virus) were added to wells in 50 ^1 volumes, and incubated for 24 

hours as above. Medium was carefully removed and wells were rinsed 2x with warm 

phosphate-buffered sahne (PBS). Neutral red (100 jiVwell ofl:50 dilution in medium) 

was added to the wells and incubated for 2 hours as above. Glutaraldehyde (50 nl/well of 

25 0.5% in PBS) was added and incubated for 30 minutes as above. Wells were washed 2x in 

PBS, and 100 jil/well of a solution of 50% methanol, 1% acetic acid was added. 

Absorbance at 540 nanometers (nm) was measured using a microplate reader. 

Fig. 2 shows the antiproliferative activity and the antiviral activity of exemplary 

interferon homologues of the invention, in comparison with interferon alpha-2a and 

30 interferon-alpha Conl. The graph shows the number of Units activity per milligram of 

homologue (Y axis) for a set of exemplary interferon alpha homologues, each of which is 

designated with a "name" on the X axis. 
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EXAMPLE 2: IN VITRO CANCER CELL LINE SCREEN 

An in vitro cell line screen (as described in, e.g.. Monks, A. ei al (1991) J. Nat 7 
Cancer Inst. 83:757-766 (hereinafter "Monks") and 

http://dtD.nci.gov./branches/btb/i vclsD.html . each of which is incorporated herein by 
5 reference in its entirety for all purposes) was used to assay interferon-alpha homologues of 
the invention for selective growth inhibition and/or cell killing of particular cancer cell 
lines. The 60 human cancer cell lines used (Table 3) include leukemias, melanomas, and 
cancers of the lung, colon, brain, ovary, breast, prostate, central nervous system, renal 
system, and kidney. Human tumor cell lines were grown according to procedures outlined 
10 in Monks") and http://dtp.nci. gov./branches/btb/i vcisp.html . 



Table 3 

Human cancer cell lines screened 



Cancer type 


Cell lines 


Leukemia 


CCRF-CEM. HL-60 (TB), K-562, MOLT-4, RPMI-8226, SR 


Colon cancer 


COLO 205, HCC-2998, HCT-15, HCT-1 16. HT29, KM12, SW-620 


CNS cancer 


SF-268, SF-295, SF-539, SNB-19, SNB-75, U251 


Lung cancer 


A549/ATCC, EKVX, HOP-62, HOP-92, NCI-H23, NCI-H226. NCI- 
H322M, NCI-H460, NCI-H522 


Breast cancer 


MCF-7, NCI/ADR HS578T. MDA-MB-231/ATCC, MDA-MB-435, 
MDA-N. BT-549,T-47D 


Melanoma 


LOX IMVI, M14, MALME-3M, SK-MEL-2, SK-MEL-5, SK-MEL- 
28, UACC-62, UACC-257 


Ovarian cancer 


IGROVl. OVCAR-3, OVCAR-4. OVCAR-5, OVCAR-8, SK-OV-3 


Prostate cancer 


DU-145.PC-3 


Renal cancer 


786-0. A498. ACHN. CAKI-1. RXF 393, SN12C, TK-10, UO-31 



15 Briefly, cells were inoculated into 96 well microtiter plates at densities 

ranging from about 5,000 to about 40.000 cells/well, depending on the growth properties 
of the particular cell line. After inoculation, the microtiter plates were incubated for 24 
hours (h) at 37 degrees C prior to addition of test samples (e.g., interferon homologues of 
the invention or control interferons). After 24 h, two plates of each cell line were fixed in 

20 situ with trichloroacetic acid (TCA), to provide a measurement of the cell population for 
each cell line at the time of test sample addition (To). To the remaining plates, interferon 
samples (affmiiy-purified froni CHO cell supematants) were added in five 10-fold serial 
. dilutions ranging from 10"®'* to 10"^ * Mg/n^l. 
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Following sample addition, the plates were incubated for an additional 6 
days. The assay was terminated by addition of TCA. 

Cell population was determined by measuring cellular protein in a 
quantitative protein dye-binding assay. Sulforhodamine B solution (100 ^il) at 0.4 % (w/v) 

5 in 1% acetic acid was added to each well, followed by incubation for 10 minutes at room 
temperature. Unbound dye was removed by washing five times with 1% acetic acid and 
the plates air- dried. Protein-bound dye was solubilized with 10 milHMolar (mM) Tris, 
and the absorbance read at 515 nanometer (nm) on an automated plate reader . 

Seven absorbance measurements were taken for each dose-response assay, 

10 corresponding to: the amount of cellular protein prior to sample addition (time zero; To), 
the amount of cellular protein at the end of the incubation period in the absence of test 
sample (control growth, G), and five measurements corresponding to the amount of 
cellular protein at the end of the incubation period in the presence of each of the five 
concentrations of interferon test sample (test growth in presence of interferon test sample 

15 at the five concentration levels, Tj). These measurements were used to calculate the 
following three parameters for each test sample: 

G150, or "growth inhibition of 50%," is the concentration of interferon test 
sample at which cell growth is inhibited by 50%. as measured by a 50% reduction in the 
net protein/polypeptide increase in the interferon test sample as compared to that observed 

20 in the control cells (no test sample) at the end of the incubation period. GI50 is calculated 
as the concentration of test sample where [(TrTo)/(C-To)] x 100 = 50. See Fig. 3A. 

TGI, or "total growth inhibition." is the concentration of interferon test 
sample at which cell growth is totally inhibited, wherein the amount of cellular protein at 
the end of the incubation period equals the amount of cellular protein at the beginning of 

25 the incubation period. The concentration of interferon test sample that produces total 
growth inhibition (TGI) is calculated as the concentration of test sample where T, = To. 

LC50 is the concentration of interferon test sample at which a 50% 
reduction in the measured amount of cellular protein at the end of the incubation as 
compared to that at the beginning of the incubation period is observed, indicating a net 

30 loss of cells following interferon test sample addition. LC50 is calculated as the 
concentration of test sample where [(Ti-To)/To] x 100 = -50. 
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If, for a particular test sample, an effect was not achieved or was exceeded 
ai the concentration range tested, the value for that parameter was expressed as greater or 
less than the maximum or minimum concentration tested. 



5 EXAMPLE 3: IN VITRO ACTIVITY OF DFN-ALPHA HOMOLOGUES CORRELATES 
WTTH IN VIVO EFHCAC Y. 

Fragments of human interferon-alpha genes were shuffled and screened for 
activity in a murine cell-based antiviral assay as described by Chang et al (1999) Nature 
Bioiechnol 17:793-797. Interferon-alpha homologues that exhibited over 10^-fold higher 

10 antiviral activity than human interferon-alpha 2a against mouse cells were isolated. The 
antiviral activities of a number of the interferon-alpha homologues even significantly 
exceeded the antiviral activity of native mouse interferons, including Mu-lFN-alpha 4 
(Chang et al, supra). Recursive sequence recombination {e.g., DNA shuffling) of human 
interferon-alpha gene fragments to produce novel interferon alpha homologues and 

15 subsequent screening of such homologues against murine interferon receptors resulted in 
the identification and isolation of interferon-alpha homologues with activity optimized for 
the distantly related murine species. 

A dose-response study in mice was performed to determine if the high 
antiviral activity observed in vitro is sustained in vivo. Two of the mouse-optimized 

20 interferon-alpha homologues, designated herein as CH2.2 and CH2.3 (SEQ ID NOS:84 
and 85, respectively), were used in this study. CH2.2 and CH2.3 were shown to have 
about 138,0G0-fold and about 206,00-fold higher activity, respectively, than human 
interferon-alpha 2a, and about 2.5-fold and about 1.6-fold higher activity than native 
mouse interferon-alpha 4, in the in vitro mouse cell antiviral assay (Chang et al, supra). 

25 Groups of Balb/c mice received subcutaneous doses of either phosphate 

buffered saline (PBS), interferon-alpha homologue CH2.2, interferon-alpha homologue 
CH2.3, murine IFN-alpha 4, or human interferon-alpha 2a, in daily subcutaneous doses of 
2, 10, or 50 pg (total volume of 50 nl) for four consecutive days. On day 2, the mice were 
exposed to a lethal intranasal dose (ten times the LC50) of vesicular stomatitis virus 

30 (VS V). Data is expressed as the number of mice which survive to day 21 . 

Fig. 5 shows that both of the mouse-optimized interferon-alpha 
^ homologues, CH2.2 and CH2.3, were as effective or more effective than native murine 
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interferon Mu-BFN alpha 4 in protecting mice from VSV. At the concentrations tested, 
human IFN-alpha 2a was nearly completely ineffective in protecting mice from the virus. 
Thus, the in vivo efficacy of the interferon-alpha homologues of the invention correlates 
remarkably well with the antiviral activities observed in the in vitro assays. 

5 While the foregoing invention has been described in some detail for 

purposes of clarity and understanding, it will be clear to one skilled in the art from a 
reading of this disclosure that various changes in form and detail can be made without 
departing from the true scope of the invention. For example, all the techniques, methods, 
compositions, apparatus and systems described above may be used in various 

10 combinations. All publications, patents, patent applications, or other documents cited in 
this application are incorporated herein by reference in their entirety for all purposes to 
the same extent as if each individual publication, patent, patent application, or other 
document were individually indicated to be incorporated by reference for all purposes. 



SEQUENCES 

15 ^ 



SEQ ID 


Clone 
ID 


Sequence 


SEQ ID NO: 1 


2DH12 


TGTGATCTGCCTCAGACCCACAGCCTTGGCAACAGGAGGGCCTTGATG 
CTCCTGGCACAAATGGGACGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACAAGACTTTGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGATTCATCTGCTGCTTGGGAACAGACC 
CTCCTAGAAAAATTTTCCACTGAACTCTACCAGCAGCTGAATGACCTG 
GAAGCCTGCGTGATACAGGAGGTAGGGGTGAAAGAGACTCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAGGAAGTACTTCCAAAGAATCACT 
CTTTATCTAATAGAGAGGAAATACAGCCCTTGTGCATGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQE)NO:2 


2CA3 


TGTGATCTGCCTCAGACCCACAGCCTTGGTGACAGGAGGGCCATGATA 
CTCCTGGCACAAATGGGACGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGATATGATTTCGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACT^GGATTCATCTGCTGCTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAGCTGAATGAACTG 
GAAGCATGTGTGATACAGGAGGTTGGGGTGGGAGAGACTCCCCTGATG 
AATGGGGACTCCATCCTGGCTGTGAAGAAGTACTTCCAAAGAATCACT 
CTTTATCTAATAGAGAGGAAATACAGCCCTTGTGCATGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAA7UV 
AGATTAAGGAGGAAGGAA 


SEQ ED NO:3 


4AB9 


TGTGATCTGCCTCAGACCCACAGCCTTGGCAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGACGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGATTCCCCCGGGAGGAGTTTGATGGCAACCAGTTC 
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CTCCTGGCACAAATGGGACGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGATTCCCCCGGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATGCAGCAGACC 
TTCAATCTCTTCAGCACAAAGAACTCATCTGCTGCTTGGGATGAGACC 
PTrrTAnAAAAA TTTTP CAC TG AAC TTT AC C AG C AAC TG AATG AACTG 
GAAGCATGTGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGAGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTGACAGAGAAGAAGTATAGCCCTTGTTCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:4 


2DA4 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATG 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACAAGACTTTGGATTCCCCCAGGAGGAGTTTGATAGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATGCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGACTCATCTGCTGCTTGGGATGAGACC 
rrnr-orp ?v r» A AAA ATTTTPP ZVPTr^ A APTPTAPP Afif AnPTGAATGACCTG 
GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACCCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAGGAAGTACTTCCAAAGAATCACT 
CTTTATCTAATAGAGAGGAAATACAGCCCTTGTGCATGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID N0:5 


3DA11 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGGTA 
CTCCTGGCAC7VAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGATATGATTTCGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGATTCATCTGCTGCTTGGGATGAGACC 

GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACCCCCCTGATG 
AATGAGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTAATAGAGAGGAAATACAGCCCTTGTGCATGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQIDNO:6 


2DBH 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATG 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGATATGATTTCGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGATTCATCTGCTGCTTGGGATGAGACC 
r^rnrT'rnf.r'T^T^ A A 7\ TTTT^TPP A PTP A A PT»TT A PP Ar^PAHPTtt AATG ACTTG 
GAAGCCTGTGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAGGAAGTACTTCCAAAGAATCACT 
CTTTATCTAATAGAGAGGAAATACAGCCCTTGTGCATGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


CFO TD NO-7 


2CA5 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGACGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACAAGACTTTGGATTCCCCCAGGAGGAGTTTGATGGCAACCGGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGAACTCATCTGCTGCTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTCTACCAGCAGCTGAATGACCTG 
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CTCCTGGCACAAATGGGAAGAATCTCTCCTT^^^ 

AGACATGACTTTGGATTCC^^^^^ 

CAGAAGGCTCAAGCCATCTCTGTCCTCCA 

TTCAATCTCTTCAGCACAAAGGACTCATCTGCTACTT^^ 

ScCTAGAAAAATTl^CCACTGAACT^^^^ 
GAAGCCTGCGTGATACAGGAGGTTGGGGTCGAAG^^^ 

AATGTGGACCCCATCCTGGCTGTGAAGAAATAC^^^^ 
AGATTAAGGA55^^2^^ 

^m^mr.<-rr/-!(-r'Tr;A_AGGAC 



t SEQIDNO-.IO 



CAGAAGGCTCAAGCCATCTCTOTC ^^^^ ^^^^ 



2G5 



SEQIDN0:11 




2BA8 



rc 1 AA 

AGAGCAGAAATCATGAGATCTTTCTCr 

"CTTGGTAACAGGAGGGCCCTGATA 



RGMATGATTTCGGRTTCCCrcMGAGGAGTTOA^^^^^ 
' CAGAAGGCKAAGCCATCTCTCTCCTCCATOAG^^ 
I TTCAATCTCTTCAGCACAAAGGATraTC^^^ 
I CTCCTAGAAAAATTWCCACTC«CTTTAa^ 

^AGG"" 
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SEQIDNO:12 


1F3 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGGACAAATGGGAAGAATCTCTCATTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAACCTCTTCAGCACAAAGGACTCATCTGTTGCTTGGGATGAGAGG 
CTTCTAGACAAACTCTATACTGAACTTTACCAGCAGCTGAATGACCTG 
GAAGCCTGTGTGATGCAGGAGGTGTGGGTGGGAGGGACTCCCCTGATG 
AATGAGGACTCCATCCTGGCTGTGAGAAAATACTTCCAAAGAATCACT 
CTCTATCTGACAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQIDN0:13 


4BE10 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAGATGGGACGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGATATGATTTCGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATAATGCAGCAGACC 
TTCAATCTCTTCAGCACAAAGAACTCATCTGCTGCTTGGGATGAGACC 
CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAACTGAATGAACTG 
GAAGCATGTGTGATACAGGGGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGAGGACTCCATCTTGGCTGTGAGGAAATACTTCCAAAGAATCACT 
CTTTATCTGACAGAGAAGAAGTATAGCCCTTGTTCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQIDNO:14 


2DD9 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATG 
CTCCTGGCACAAATGGGAAGAATCTCCCCTTTCTCCTGCCTGAAGGAC 
AGATATGATTTCGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGATTCATCTGCTGCTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGGACTCTACCAGCAGCTGAATGACCTG 
GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACCCCCCTGATG 
AATGAGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTGACAGAGAAGAAGTATAGCCCTTGTTCCfGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 

AGATTAAGGAGGAAGGAA 


SEQIDNO:15 


3CA1 


TGTGATCTGCCTCAGACCCACAGCCTTGGCAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGATTACCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGAACTCATCTGCTGCTTGGGATGAGACC 
CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAACTGAATAACCTG 
GAAGCATGTGTGATACAGGAGGTTGGGATGGAAGAGACTCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTGACAGAGAAGAAGTATAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQIDN0:16 


2F8 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGACGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGATATGATTTCGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATGCAGCAGACC 
TTCAATCTCTTCAGCACAAAGAACTCATCTGCTGCTTGGGATGAGACC 
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CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAACTGAATGAACTG 
GAAGCATGTGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGAGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTGACAGAGAAGAAGTATAGCCCTTGTTCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQIDN0:17 


6CG3 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAAGAGGGCCATGATG 
CTCCTGGCACAAATGGGAAGAACCTCTCCTTTCTCCTGTCTGAAGGAC 
AGACATGACTTTGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAGGGCTCAAGCCATCTTTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATTTCTTCAGCACAAAGGACTCATCTGCTGCTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTTAACCAGCAGCTGAATGACCTG 
GAAGCCTGCGTGATACAGGAAGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGAGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTGACAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGG7VA • 


SEQIDN0:18 


3CG7 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGTAGGGCCTTGATG 
CTCCTGGCACAAATGGGAAGAATCTCCCCTTTCTCCTGCCTGAAGGAC 
AGACATGATTTCGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGCCTTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGATTCATCTGCTGCTTGGGAACAGAAC 
CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAACTGAATAACCTG 
GAAGCATGTGTGATACAGGAGGTTGGGATGGAAGAGACTCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAGGAAGTACTTCCAAAGAATCACT 
CTTTATCTAATAGAGAGGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQIDN0:19 


1D3 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCATTTCTCCTGCCTGAAGGAC 
AGACATGATTTCGGATTCCCCCAGGAGGAGTTTGATGGCCACCAGTTC 
CAGAAGACTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGACTCATCTGCTGCTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAACTGAATGACCTG 
GAAGCATGTGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGAGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTGATGGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:20 


2G4 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCATGATG 
CTCCTGGCACAAATGAGCAGAATCTCTCCTTCCTCCTGTCTGATGGAC 
AGACATGACTTTGAATTTCCCCAGGAGGAATTTGATGATAAACAGTTC 
CAGAAGGCTCCAGCCATCTCTGTCCTCCATGAGGTGATTCAGCAGACC 
TTCAATCTCTTCAGCACAGAGGACTCATCTGCTGCTTGGGAACAGACC 
CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAACTGAATGACCTG 
GAAGCATGTGTGATGCAGGAGGAGAGGGTGGGAGAAACTCCCCTGATG 
AATGCGGACTCCATCTTGGCTGTGAGGAAATACTTCCAAAGAATCACT 
CTTTATCTGACAAAGAAGAAGTATAGCCCTTGTTCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
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AGATTAAGGAGGAAGGAA 


SEQIDN0:21 


lAl 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCATTTCTCCTGCCTGAAGGAC 
AGATATGATTTCGGATTCCCCCAGGAGGTGTTTGATGGCAACCAGTTC 
CAGAAGGCCCAAGCCATCTCTGCCTTCCATGAGATGATGCAGCAGACC 
TTCAATCTCTTCAGCACAGAGGACTCATCTGCTGCTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTTCACCAGCAACTGAATGACCTG 
GAAGCCTGTGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGAGGACTCCATCCTGGCTGTGAGGAAATACTTTCAAAGAATCACT 
CTTTATCTAATGGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:22 


IDIO 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCATTTCTCCTGCCTGAAGGAC 
AGACATGATTTCGGATTCCCCCAGGAGGAGTTTGATGGCCACCAGTTC 
CAGAAGACTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGACTCATCTGCTGCTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAACTGAATGACCTG 
GAAGCATGTGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGAGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTGATGGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ © NO:23 


1F6 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGACTTTGATG 
ATAATGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGATTTCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGACTCATCTGCTACTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTTAACCAGCAGCTGAATGACCTG 
GAAGCCTGCGTGATACAGGAGGCTGGGGTGGAAGAGACTCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTAACAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:24 


2A10 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 

CTCCTGGCACAAATGGGAAGAATCTCTCATTTCTCCTGCCTGAAGGAC 

AGATATGATTTCGGATTCCCCCAGGAGGTGTTTGATGGCAACCAGTTC 

CAGAAGGCTCAAGCCATCTCTGCCTTCCATGAGATGATCCAGCAGACC 

TTCAATCTCTTCAGCACAAAGGACTCATCTGCTACTTGGGAACAGAGC 

CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAACTGAATAACCTG 

GAAGCATGTGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 

AATGAGGACTCCATCCTGGCTGTGAGGAAATACTTTCAAAGAATCACT 

CTTTATCTGATGGAG AAGAAATACAGCCCT 1 1 (jCL. 1 vjVjVaAovci i i i 

AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 

AGATTAAGGAGGAAGGAA 


SEQIDNO:25 


2C3 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 

CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGATTTCCTCAGGAGGAGTTTGATGGCAACCAGTCC 



114 



OCID- <WO OlZ5-t38A2 ' > 



wo 01/25438 



PCT/US0()/27781 







CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGACTCATCTGATACTTGGGATGCGACC 
CTTTTAGAAAAATTTTCCACTGAACTTAACCAGCAGCTGAATGACCTG 
GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACCCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTGACAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ED NO:26 


201 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGACGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACAAGACTTTGGATTCCCCCAGGAGGAGTTTGATGGCAACCGGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGAACTCATCTGCTGCTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTCTACCAGCAGCTGAATGACCTG 
GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACCCCCCTGATG 
AATGAGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTAATAGAGAGGAAATACAGCCCTTGTGCATGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQIDNO:27 


2Dia 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGAAGAGTCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGCCTTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGACTCATCTGCTACTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAACTGAATAACCTG 
GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAAGAAATACTTCCGAAGAATCACT 
CTCTATCTGACAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:28 


2D7 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGCGGGCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTC.TCCTGTCTGAAGGAC 
AGACATGACTTCAGATTTCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGACTCATCTGCTACTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTTTACCAGCAACTGAATAACCTG 
GAAGCTTGCGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGTGGACTCTATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTGACAGAGAGGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:29 

V 


2D9 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACT 
TTCAATCTCTTCAGCACT^GGACTCATCTGCTACTTGGGAACAGAGC 
^CTCCTAGAAAAATTTTCCACTGAACTTAACCAGCAGCTGAATGACCTG 
GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGGTG 
AATGTGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
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CTTTATCTGACAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ED NO:30 


2DA2 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGCCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACAGGACTTCGGATTCCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATGCAGCAGACC 
TTCAATCTCTTCAGCACAAAGAACTCATCTGCTGCTTGGGAACAGAGC 

GAAGCATGTGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 

AATGTGGACTCCATCCTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
CTTTATCTAATAGAGAGGAAATACAGCCCTTGTGCATGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQIDNO:31 


2DH9 


TGTGATCTGCCTCAGACCCACAGCCCTGGTAACAGGAGGGCCTTGATG 
CTCCTGGCACAAATGGGACGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGATATGATTTCGGATTCCCCCAGGGGGAGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATGCAGCAGACC 
TTCAATCTCTTCAGCACAAAGGATTCATCTGCTGCTTGGGAACAGAGC 

GAAGCCTGTGTGATACAGGAGGTTGGGGTGG7VAGAGACCCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAGGAAGTACTTCCAAAGAATCACT 
CTTTATCTGACAGAGAAGAAGCATAGCCCTTGTTCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:32 


2G11 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGACTTCCCCAGGAGGAGTTTGATGGCAACCAGTTC 
CAGAAGACTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 

TTCAATCTCTTCAGCACAAAGGACTCATCTGATACTTGGGAACAGAGC 

L 1 LL 1 AVj/i/i/\/i/\ 1 X L 1 /\Ln. X X Ls/iAL X X X X LL AoL AoL X X L7/iLL X \j 

GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 
AATGTGGACTCCATCCTGGCTGTGAGAAAATACTTCCAAAGAATCACT 
CTTTATCTGACAGAGGAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:33 

V 


2G12 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGACTTTGATG 
CTCATGGCACAAATGAGGAGAATCTCTCCTTTCCCCCGCCTGAAGGAC 
AGATATGATTTCGGATTCCCCCAGGAGGTGTTTGATGGCAACCAGTTC 
CAGAAGGCTCAAGCTATCTTCCTTTTCCATGAGATGATGCAGCAGACC 
TTCAATCTCTTCAGCACAAAGAACTCATCTGCTGCTTGGGATGAGACC 
CTCCTAGACAAATTCTACACTGAACTCTACCAGCAGCTGAATGACTTG 
GAAGCCTGTGTGATGCAGGAGGGGAGGGTGGGAGAAACTCCCCTGATG 

AATr:Pr:fiAPTPPATPTTnr;PTr:Tf;AAnAAATAPTTCCGAAGAATCACT 

CTCTATCTGACAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGCTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 
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SEQIDNO:34 2H9 



SEQIDNO:35 6BC11 



CTCCTAGAAAAATTTTCCACTGAACTTAACCAGCAGCTGA^^^ 
i j.r-rrTrTrTGACACAGGAGGTTGGGGTGGAAGAGACTCCCCTbA i ^ 

a^J^aSSatcSggctgtgaagaaatact^^ 

S™??tSSSS^gaaatacagccctt^^^^ 
aSgIStcatgagatctttctctttttcaacaaacti^caaaaa 

ctggcacaaatgggaag 
agatatgatttcggattccccca 



cagSScItctctgtcctcctoagatcatccagcagacc 

TTCAATCTCTTCAGCACAAAGGATTCATr'" 



■ctgctgcttgggaacagagc 

^SScCTOGCTGTGAAGAAATACrrCCAAAGAATCACT 



CTTTATCTCACAGAGAGGAAATRCAGCCCTTGTGCCTGGGAGGTTGTC 
SSSIS^ATGAGATCTTTC'KOTrTCAACAAACTTGCAAAAA 

EACVIQEVGVKETPLMNVDSILAVRKYFQRITLYLIERKYSPCAWbVV 
p a TTTMPgPSFSTNLOKRLRRKE 



SEQ ID NO:38 



SEQ ID NO:39 



p7.TrTMBc;FSFSTNLQKRLRRKE 



SEQ ID NO:40 



SEQ ID N0:41 



SEQ ID NO:42 
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SsFsL'^^n:?^f°''^'^''''''''""''^^''"^""^'VSPCAWEw| 

KA£.i.jyLKSFSFSTNLQKRLRRKE 


SEQ m N0:4: 


J 2G6 


KAEIMRSFSFSTNLQKRLRRKE 


SEQ ID N0:4^ 


t 3AH7 


^^.IMRSFSFSTNLQKRLRRKE ^'^vvilvv 


1 SEQ ID NO:45 


2G5 


CDL^WTHSLGNRRALMLLAQMGRISPFSCLKDRQDFGFPOEEFDGNQF 

QkaqaisvlhemiqqtfnlfstkdssaaweqsllekfstIl^^^^ 

KAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:46 


2BA8 


CDLPQTHSLGNRRALILLAQMGRISPFSCLKDRYDFGFPOEEFDrMnp 

QkaqaisvlhemiqotfnlfstkdssaaweqslleSste^^^^ 

EACVIQEVGVEETPL^l^m)SILAVRKYFQR^TLYLIER^YSP?AW^ 
RAEIMRSFSFSTNLQKRLRRKE :^i^xt.KKYSPCAWEW 


SEQ ID NO:47 
SEQ ID NO:48 
SEQ ID NO:49 1 


IF3 
4BE10 

2DD9 


^^^^^^^^^ 

CDLPQTHSLGNRRALILLAQMGRISPFSCLKDRYDFGFPOEEFDGNnFl 

QkaqaisvlheimqqtfnlfstknssaawdetllekStel^^^ 

RAEIMRSFSFSTNLQKRLRRKE ^iot-«_awiiVV 

cdlpqthslgnrralmllaqmgrispfsclkdrydfgfpoeefdgnqp • 
Qkaqaisvlhemiqqtfnlfstkdssaaweqsllek%?g™^^ 

RAEIMRSFSFSTNLQKRLRRKE v,oiivcvv 


SEQ ID NO;50 


3CA1 




SEQ ID N0:51 


2F8 


CDLPQTHSLGNRRALILLAQMGRISPFSCLKDRYDFGFPOEEFDrNnF 

QkaqaisvlhemmqqtfnlfstknssaawdetllSSS 

RAEIMRSFSFSTNLQKRLRRKE oiruc:»wiLvv 

^PTMPci^?!f''^'^'''^'=''^^^^^^YPQRITLYLTEKKYSpSwEW 
^EIMRSFSFSTNLQKRLRRKE oxrv-^wcvv 


SEQ ID NO:52 
rSEQIDNO:53 ■ 


5CG3 ■ 

1 
] 




tCG7 c 

C 
E 

F 


-DLPQTHSLGNSRALMLLAQMGRISPFSCLKDRHDFGFPbEEFDrNOF 
?KAQAISAFHEMIQQTFNLFSTKDSSAAWESjLLFK?QTpfvnn^^^^ 


SEQ ID NO;54 1 


D3 c 
0 
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SEQ ED NO:55 



SEQ ID NO:56 



2G4 



lAl 



SEQ ID NO:57 



SEQ ID NO:58 



SEQIDNO:59 



SEQ ID NO:60 



IDIO 



2A10 



qSpaisvlheviqqtfnlfstedssaaweqtllekf^^^^ 

EACVMQEERVGETPLMNADSILAVRKYFQRITLYLTKKKYSPCSWEV 



EACV 
RAE 



RAEIMRSFSFSTN LO KRT.RRKE ^ ^ 



2C3 



SEQ ID NO;61 



SEQ ID NO:62 



SEQ ID NO;63 



SEQ ID NO;64 



SEQ ID NO;65 



SEQIDNO;66 



e*cvmSSS;i>si"vvkkvfqritlvltekkvspcawew 



QKAQA 



119 



wo 01/25438 



PCT/US0()/27781 



SEQ ID NO:67 
SEQ ID NO:68 


~ 2G11 
~ 2GI2 


EACVIQEVGVEETPLMNVDSILAVRKYFQRITLYLTEKKHSPCSWEW' 
RAEIMRSFSFSTNLOKRLRRKE 

CDLPQTHSLGNRRALILLAQMGRISPFSCLKDRHDFGLPQEEFDGNQF 

QKTQAISVLHEMIQQTFNLFSTKDSSDTWEQSLLEKFYIELFQQLNDL 

EACVIQEVGVEETPLMNVDSILAVRKYFQRITLYLTEEKYSPCAWEW 
RAEIMRSFSFSTNLOKRLRRKE 

CDLPQTHSLGNRRTLMLMAQMRRISPFPRLKDRYDFGFPQEVFDGNQF 

QKAQAIFLFHEMMQQTFNLFSTKNSSAAWDETLLDKFYTELYQQLNDL 

EACVMQEGRVGETPLMNADSILAVKKYFRRITLYLTEKKYSPCAWEAV 
RAEIMRSFSFSTNLOKRLRRKE 

CDLPQTHSLGNRRALILLAQMGRISPFSCLKDRHDFGFPQEEFDGNQF 

QKAQAISVLHEMIQQTFNLFSTKDSSATWEQSLLEKFSTELNQQLNDL 

EACVTQEVGVEETPLMNEDSILAVKKYFQRITLYLTEKKYSPCAWEW 
RAEIMRSFSFSTNLOKRLRRKE 


SEQ ID NO:69 


"2H9 


SEQ ID NO: 70 


6BC11 


CDLPQTHSLGNRRALILLAQMGRISPFSCLKDRYDFGFPQEEFDGNQL 1 

QKAQAISVLHEMIQQTFNLFSTKDSSAAWEQSLLEKFSTELNQQLNDL 

EACVIQEVGVEETPLMNVDSILAVKKYFQRITLYLTERKYSPCAWEW 
RAEIMRSFSFSTNLOKRLRRKE 


SEQIDNO:?! 


H9bb 


CDLPQTHSLGXXRAXXLLXQMXRXSXFSCLKDRXDFGXPXEEFDXXXF 

1 QXXQAIXXXHEXXQQTFNXFSTKXSSXXWXXXLLXKXXTXLXQQLNXL 

EACVXQXVXXXXTPLMNXDXILAVXKYXQRITLYLXEXKYSPCXWEW 
RAEIMRSFSFSTNLOKRLRRKE 


SEQ ID NO:72 


CHl.l 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 

CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGTCTGATGGAC 

AGACATGACTTTGGATTTCCCCAGGAGGAGTTTGATGACAACCAGTTC 

CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAACAGACC 

TTCAATCTCTTCAGCACAAAGGACTCATCTGCTACTTGGGATGAGACA 

CTTCTAGACAAATTCTACACTGAACTTTACCAGCAGCTGAATGACCTG 

GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 

AATGAGGACTCCATCTTGGCTGTGAAGAAATACTTCCGAAGAATCACT 

CTCTATCTGACAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 

AGAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQ ID NO:73 


CH1.2 

j 

1 ^ 
1 ( 

1 . 
i 


TGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGGCCTTGATA 
CTCCTGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
AGACATGACTTTGGATTCCCCCAGGAGGAGTTTGATCGCAACCAGTTC 
CAGAAGGCTCAAGGCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 
TTCCATCTCTTCAGCACAAAGGACTCATCTGCTACTTGGGAACAGAGC 
CTCCTAGAAAAATTTTCCACTGAACTTAACCAGCAGCTGAATTArrTr 

GAAGCCTGCGTGATACAGGAGGTTGGGGTGGAAGAGACTCCCCTGATG 

AATGTGGACTCCATCCTGGCTGTGAAGAAATACTTCCGAAGAATCACT 

^TTTATCTGACAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 

'^GAGCAGAAATCATGAGATCTTTCTCTTTTTCAACAAACTTGCAAAAA 
\GATTAAGGAGGAAGGAA 


SEQIDNO:74 i 


::hi.3 

u 
1 i 


rGTGATCTGCCTCAGACCCACAGCCTTGGTAACAGGAGGACTTTGATG 
^TAATGGCACAAATGGGAAGAATCTCTCCTTTCTCCTGCCTGAAGGAC 
\GACATGACTTTGGATTTCCTCAGGAGGAGTTTGATGGCAACCAGTTC 
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CAGAAGGCTCAAGCCATCTCTGTCCTCCATGAGATGATCCAGCAGACC 

Sc^?ctStcagcacaaaggactcatctgctacttgggatgaga^ 
cSSIgacaaattctacactgaactttaccagcagctgaatgacctg 

S?S^ATGATGCAGGAGGrrGGAGTGGAAGACA^^^ 

aatgtggactctatcctgactgtgagaaaatactttcgaagaa^^^ 

CTTTATCTGACAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 

aSIgaLtcatgagatctttctctttttcaacaaacttgcaaaaa 
agattaaggaggaaggaa 



SEQIDNO:75 CHi.4 



TCTTGGCTGTGAAGAAATACTTCCAAAGAATCACT 
' GAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 



SEQ ID NO:76 



TGTGATCTGCCTCAGACCCACAGCC TGGGTAATAGGAGGGCCTTGATA 

ScSSScS^TGGGAAGAATCTCTCCTTTCT^^^^^^^ 
AGACATGACTTTGGATTCCCCCAGGAGGAGTTTGGTGGCAACCAGTTC 

CAG^GGCTCAAGCCATCTCTGTCCTCCATGAGATC^^^ 

ttc^tcS^^aSacagaggactcatctgctgcttggg^^^^^ 

™gIcaaattctacattgaact™^^ 
gaagcctgtgtgatgcaggaggagagggtgggagaaactcccctgatg 

AATGCGGACTCCA' 

SIgSgIS™^^^^^^^ 

SSScaaatgggaagaatctctcctttctcctgcctgaagga^ 
agSSctttggatttcctcaggaggagtttgatggcaaccact^^ 
SgSSLgccatctctgtcctccatgagatgatccagcagacc 
™^tctcttcagcacaaaggactcatctgctacttgggat^^^^ 
Sacaaattctacactgaactt^^^^^^^ 



gaagcctgtatgatacaggaggtti 



ggggtggaagagactcccctgatg 



AATGAGGACTCCATCT' 

ctctatctgacaga 



tggctgtgaagaaatacttccgaagaatcact 
gaagaaatacagcccttgtgcctgggaggttgtc 



^GA^CAG^T^A^GAis^TC^^^^ 



^nATTAAGGAGGAAGGAA 



AGAcI?ScT;?Sii^TTCCCC^^^^^ 

???Sggctcaagccatctctgtcctccatgagatgatccaacagacc 

SS^ScS^SScAAAGGACTCATCTGCTACTTGGGATGAGA^ 

^?^SIgIcaLttctacactgaactttaccagcagctgaatgacctg 

G™CC?GTATGATGCAGGAGGTTGGAGTGGAAGACACTCCTC^^^ 

^SSactctatcctgactgtgaagaaatacttccgaagaatcact 
ot?I?cSagagaagaaatacagcccttgtgcctgggaggttgtc 
aSgcIScatgagatctttctctttttcaacaaacttgcaaaaa 



SEQ ID NO:78 CH2.3 



AGATTAAGGAGGAAGGAA 



tgtgatctgcctcagacccacai 



.GCCTTGGTAACAGGAGGACTTTGATG 



caaatgggaagaatctctcctttctcctgcctgaaggac 

TTTCCTCAGGAGGAGTTTGATGGCAACCAGTTC 
.TCTCTGTCCTCCATGAGATGATCCAGCAGACC 



ataatggca 
agacatgactttgga- 

cSSIScIaattctacactgaactttaccagcagctgaatca^^^ 



2vr.rrTGTATGATGCAGGAGGTTGGA( 



rinvja&AriACACTCCTCTGATG 
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AATGAGGACTCCATCTTGGCTGTGAAGAAATACTTCCGAAGAATCACT 
CTCTATCTGACAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTC 
AGAGCAGAAATCATGAGATCTTTCTCTTTCTCAACAAACTTGCAAAAA 
AGATTAAGGAGGAAGGAA 


SEQIDNO:79 


CHl.l 


CDLPQTHSLGNRRALILLAQMGRISPFSCLMDRHDFGFPQEEFDDNQF 
QKAQAISVLHEMIQQTFNLFSTKDSSATWDETLLDKFYTELYQQLNDL 
EACVIQEVGVEETPLMNEDSILAVKKYFRRITLYLTEKKYSPCAWEW 
RAEIMRSFSFSTNLQKRLRRKE 


SEQIDNO:80 


CHI. 2 


CDLPQTHSLGNRRALILLAQMGRISPFSCLKDRHDFGFPQEEFDGNQF 
QKAQGISVLHEMIQQTFHLFSTKDSSATWEQSLLEKFSTELNQQLNDL 
EACVIQEVGVEETPLMNVDSILAVKKYFRRITLYLTEKKYSPCAWEW 
RAEIMRSFSFSTNLQKRLRRKE 


SEQlDNO:81 


CHI. 3 


CDLPQTHSLGNRRTLMIMAQMGRISPFSCLKDRHDFGFPQEEFDGNQF 
QKAQAISVLHEMIQQTFNLFSTKDSSATWDETLLDKFYTELYQQLNDL 
EACMMQEVGVEDTPLMNVDSILTVRKYFRRITLYLTEKKYSPCAWEW 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:82 


CH1.4 


CDLPQTHSLGNRRALILLAQMGRISPFSCLKDRHDFGFPQEEFGGNQF 
QKAQAI S VLHEMI QQTFNLF STEDS S AAWDETLLDKF Y I ELFQQLNDL 
EACVMQEERVGETPLMNADSILAVKKYFQRITLYLTEKKYSPCAWEW 
RAEIMRSFSFSTNLQKRLRRKE 


SEQIDNO:83 


CH2.1 


CDLPQTHSLGNRRTLMIMAQMGRISPFSCLKDRHDFGFPQEEFDGNQF 
QKAQAISVLHEMIQQTFNLFSTKDSSATWDETLLDKFYTELYQQLNDL 
EACMIQEVGVEETPLMNEDSILAVKKYFRRITLYLTEKKYSPCAWEW 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:84 


CH2.2 


CDLPQTHSLGNRRALILLAQMGRISPFSCLMDRHDFGFPQEEFDDNQF 
QKAQAISVLHEMIQQTFNLFSTKDSSATWDETLLDKFYTELYQQLNDL 
EACMMQEVGVEETPLMNVDSILTVKKYFRRITLYLTEKKYSPCAWEW 
RAEIMRSFSFSTNLQKRLRRKE 


SEQ ID NO:85 


CH2.3 


CDLPQTHSLGNRRTLMIMAQMGRISPFSCLKDRHDFGFPQEEFDGNQF 
QKAQAISVLHEMIQQTFNLFSTKDSSATWDETLLDKFYTELYQQLNDL 
EACMMQEVGVEETPLMNEDSILAVKKYFRRITLYLTEKKYSPCAWEW 
RAEIMRSFSFSTNLQKRLRRKE 
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, Anisoiacedorreconibinantnuc.eicac.d.compns,ng: 

(a) SEQ ID NO: 1 to SEQ ID NO:35. or a complementary p 
thereof; „Hino a ooWpeptide selected from SEQ ID 

NO:36 .0 SEQ <D NO:70. o, a -"P'" ' ^, s,Hng.n, condUions 

„,e, subs^ntiaU, .Uc -u-e >eng.h of poly 

fragment encodes a polypepuu 



line - based assay. 



15 



20 



25 



, Anisolatedorrecombinantnucleicacid,compns.ng. 

.elected from the group consisimg ot. 
a polynucleotide sequence selected iro polynucleotide 
(a)SEQlDNO:72toSEQlDNO:78,oracomplementaryp 

sequence thereof; polypeptide selected from SEQ ID 

.O..79.0SEQn,NO:S5,o,aco™p^™^»^^ 

over sub.u„«y '="8'" >^ , ,„^„, of (a), (b) o, (O. wh,ch 

,„g^„.»c«.«apo.yp.p<iaehavinga.uv,raUc„>n.y 

')ased assay. 

ap„,,nudeo«ae sequence encodrngapo-ypp^^^^^^^^^^^^^^^ 
^„oac.se<,uenc. COU.QTHSUG*.X.RA^^^^^ 

X..RCUa,R.X«-WG-X,.^.^HB^ J^^^^^^^^ ^^^^^^^^ 
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WEVVRAErMRSFSFSTNLQKRLRRKE. or a conservatively substituted variation 
thereof, where Xn is N or D; X12 is R, S, or K; X15 is L or M; Xio is I, M, or V; X19 is A or 
G; X22 is G or R; X24 is I or T; X26 is P or H; X34 is H, Y or Q; X38 is F or L; X40 is Q or R; 
X45 is G or S; X46 is N or H; X47 is Q or R; X50 is K or R; X51 is A or T; X55 is S or F; X56 
5 is V or A; X57 is L or F; Xao is M or I; Xei is I or M; Xe? is L or F; X72 is D or N; X75 is A 
or V; X76 is A or T; X78 is E or D; X79 is Q or E; Xgo is S, R, T, or N; X83 is E or D; Xgs is 
F or L; Xga is S or Y; Xgg is E or G; X90 is Y. H, N; X95 is D, E, or N; X,oi is I, M, or V; 
X103 is E or G; X105 is G or W; X|06 is V or M; X,o7 is E, G, or K; X,o8 is E or G; Xj u is V, 
E. or G; X, ,6 is S or P; X121 is K or R; X124 is F or L; X,32 is T, I, or M; X134 is K or R; and 
10 X|4oisAorS. 

4. The nucleic acid of claim 3, said polypeptide havingantiproliferative 
activity in a human Daudi cell line-based cell proliferation assay or antiviral activity in a 
human WISH cell/EMCV-based assay. 

5. The nucleic acid of claim 3, wherein the encoded polypeptide has an 
15 antiproliferative activity of at least about 8.3x10^ units/milligram in a human Daudi cell 

line - based assay or an antiviral activity of at least about 2.1x10^ units/milligram in a 
human WISH cell/EMCV-based assay. 

6. The nucleic acid of claim 3. wherein the encoded polypeptide 
comprises an amino acid sequence selected from the group consisting of: SEQ ID NO:36 

20 toSEQIDNO:54. 

7. The nucleic acid of claim 3, said nucleic acid comprising a 
polynucleotide sequence selected from the group consisting of: SEQ ID N0:1 to SEQ ID 
NO: 19. 

8. An isolated or recombinant nucleic acid comprising a polynucleotide 
25 sequence encoding a polypeptide, the polypeptide comprising: 

an amino acid sequence comprising at least 20 contiguous amino acids of any one 
of SEQ ID NOS:36-70, and one or more of amino acids Alal9, (Tyr or Gln)34, Gly37, 
Phe38. Lys71. Ala76. Tyr90. Ilel32, Argl34, Phel52, Lysl60, and Glul66. wherein the 
numbering of the amino acids corresponds to that of SEQ ID NO:36. 
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9. The nucleic acid of claim 8, wherein the encoded polypeptide is 166 
annino acids in length. 

10. The nucleic acid of claim 8, wherein the encoded polypeptide has an 
antiproliferative activity in a human Daudi cell line - based assay. 

11. The nucleic acid of claim 8, wherein the encoded polypeptide has an 
antiviral activity in a human WISH cell/EMCV-based assay. 

12. The nucleic acid of claim 8, wherein the encoded polypeptide 
comprises amino acids Alal9, (Tyr or Gln)34, Gly37. Phe38. Lys71, Ala76, Tyr90. 
Ilel32, Argl34, Phel52. Lysl60, and Glul66. 

13. The nucleic acid of claim 8, wherein the encoded polypeptide 
comprises at least 50 contiguous amino acid residues of any one of SEQ ID NOS:36-70. 

14. The nucleic acid of claim 8, wherein the encoded polypeptide 
comprises at least 100 contiguous amino acid residues of any one of SEQ ID NOS:36-70, 

. 15. The nucleic acid of claim 8, wherein the encoded polypeptide 
comprises at least 150 contiguous amino acid residues of any one of SEQ ID NOS:36-70. 

16. The nucleic acid of claim 8, wherein the encoded polypeptide 
comprises an amino acid sequence selected from the group consisting of: SEQ ID NO:36, 
SEQ ID NO:37. SEQ ID NO:39, SEQ ED NO:40, SEQ ID N0:41, SEQ ID NO:42, SEQ 
ID NO:45, and SEQ ID NO:46. 

17. The nucleic acid of claim 8, comprising a polynucleotide sequence 
selected from the group consisting of: SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID NO:4, SEQ 
ID N0:5, SEQ ID NO:6, SEQ ID N0:7, SEQ ID NO: 10, and SEQ ID N0:11. 

18. An isolated or recombinant nucleic acid comprising a polynucleotide 
sequence encoding a polypeptide, the polypeptide comprising: 

an amino acid sequence comprising at least 155 contiguous amino acids of any 
one of SEQ ID NOS:36-70, said amino acid sequence comprising amino acids Lysl60 
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and Glul66, wherein the numbering of the amino acids corresponds to that of SEQ ID 
NO:36. 

19. The nucleic acid of claim 1 8, wherein the encoded polypeptide 
comprises an amino acid sequence selected from the group consisting of: SEQ ID NO:36. 

5 SEQ ID NO:37, SEQ ID NO:39, SEQ ID NO:40, SEQ ID N0:41, SEQ ID NO:42, SEQ 
ID NO:45, and SEQ ID NO:46. 

20. A cell comprising the nucleic acid of claim 1, 2, 8, or 18. 

21. The cell of claim 20, wherein the cell expresses a polypeptide encoded 
by the nucleic acid. 

10 22. A vector comprising the nucleic acid of claim 1, 2, 8, or 18. 

23. The vector of claim 20, wherein the vector comprises a plasmid. a 
cosmid, a phage, or a virus. 

24. The vector of claim 22, wherein the vector is an expression vector. 

25. A cell transduced by the vector of claim 22. 

15 26. A composition comprising the nucleic acid of claim 1, 2, 8, or 18, and 

an excipient. 

27. The composition of claim 26, wherein the excipient is a 
pharmaceutically acceptable excipient. 

28. A composition produced by digesting one or more nucleic acids of 
20 claim 1, 2, 3, 8, or 18 with a restriction endonuclease, an RNAse, or a DNAse. 

29. A composition produced by a process comprising incubating one or 
more nucleic acids of claim 1, 2, 3, 8. or 18 in the presence of deoxyribonucelotide 
triphosphates and a nucleic acid polymerase. 

30. The composition of claim 29, wherein the nucleic acid polymerase is a 
25 thermostable polymerase. 
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31. An isolated or recombinant polypeptide encoded by the nucleic acid 
of acid claim 1,2, 3. 8, or 18. 

32. The isolated or recombinant polypeptide of claim 31, comprising a 

. „ of- «;fo id no- 36 to SEQ ID NO:70 or SEQ 
sequence selected from the group consistmg of. SEQ ID S^U.iO to ii^v 

5 ID NO:79 to SEQ ID N0.85. 

33 The polypeptide of claim 31. having an antiproliferative activity of at 
.east about 8.3x10^ umts/mUiigram (mg) in a human Daudi eel, line - based -ay ^^^^ 
antiviral activUy of at least about 2.1x10^ units/m.lhgram in a human WISH cell/EMCV- 

based assay. 

34 An isolaled or recombinanl polypeplide, comprising: 
.He an^no'acid se,„ence: CDLPQTHSLG.Xn-X.-RA-X,.X,rU.-X,.QM.X„. 

R.X„-S-X„-FSCLKDR-X„.DFG-X,rP-X<.EEFD-X..X„-X,vFQ-X^^^^ 

x;x„.hb.x.-x.-qq™.x.,™.ss.x..x.^^^^^^^ 

X».T.X„-I,X,„-QQLN.X,s-.XACV-X,.,-Q-X,„-V-X,«-X.„-X,„-X,„.TPU1NX,„ 

,5 D-X„o-ILAV-X,.-KY-X,„-QRrrLYl-X„rE-X,M-KYSPC-X,«- 

WEVVRAEMRSFSFSTNLQKRLRRKE,oraco„s.rv3.ivdysuW.medvana.,on 

'''"lh..inX,,isNo,D;X.isR.S,orK;X.i.LorM;X.isI.M.orV;X,,isA 

or o; X. is G o, R; X„ is . orT; X. is P or H; X. is H, V or X3. is F X« » Q or 
20 R;X„isGo,S-,X„isNorH.X.,isQorR;X.isKorR;X.,sAorT;X„.sSorF. 

X^ i V or a; X„ is Lor X„ is M or ,, X„ ,s I or M; X„ is L or F; X„ .s D or N; X„ 
isAorV;X«isAorT.,X,.isBorO.X,,,sQo,E;X.,,sS,R,T.orN.X„,sEoa 

X„isForUX„isSorY;X„isEorO;X„,sY.H,N;X,>,sD.E,orN,X,.,,s^M, 
or V; X,., is E or O. X,„ is O or W, X,„ is V or M; X,„, is E. O, or K; X™ ,s Eor 0. 

orR; andXi4oisAorS. 

35 The pol«>ep.i<le of claim 34, having anlipraliferative ac,i.i.y of al 
least ahou. 8.3x.O' „ni,s/miUigram in a human Daudi cell line - based 
- l"i„ or a, ieas. abou. 2.UI0' uni,s/mi,ligram ,n a human WISH ce>-/EMCV-based 

30 assay. 
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36. The polypeptide ofclaim 34, comprising a sequence selected from the 
group consisting of: SEQ ID NO:36 to SEQ ID NO:54. 

37. A polypeptide comprising at least 100 contiguous amino acids of a 
protein encoded by a coding polynucleotide sequence, the polynucleotide sequence 
selected from the group consisting of: 

(a) SEQ ID N0:1 to SEQ ID NO:35 or SEQ ID NO:72 to SEQ ID NO:78; 

(b) a coding polynucleotide sequence that encodes a first polypeptide selected 
from SEQ ID NO:36 to SEQ ID NO:70 or SEQ ID NO:79 to SEQ ID NO:85; 
and 

(c) a corapleinentary polynucleotide sequence which hybridizes under highly 
stringent conditions over substantially an entire length of a polynucleotide 
sequence of (a) or (b). 

38. The polypeptide of claim 37, said polypeptide having an 
antiproliferative activity in a human Daudi cell line-based cell proliferation assay or an 

15 antiviral activity in a human WISH cell/EMCV-based assay. 

39. The polypeptide of claim 37, wherein the polypeptide specifically 
binds to a human alpha-interferon receptor. 

40. The polypeptide of claim 37, comprising at least 150 contiguous 
amino acids of the encoded protein. 
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41. An isolated or recombinant polypeptide, comprising: 

an amino acid sequence comprising at least 50 contiguous amino acids of any one of SEQ 
ID NOS:36-70, the amino acid sequence comprising one or more of amino acids Alal9, 
(Tyr or Gln)34. Gly37, Phe38, Lys71, AIa76, Tyr90, Iiel32, Argl34. Phel52, Lysl60, and 
GIul66, wherein the numbering of the amino acids corresponds to that of SEQ ID NO:36. 

42. The polypeptide of claim 4 1 . wherein the polypeptide binds a human 
alpha-interferon receptor. 
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. - • 43 The polypeptide of claim 4 1 , said polypeptide exhibiting an 
antiproliferative activUy .n a human Daud, eel, l.ne-based eel, proliferation assay or an 
antiviral activity in a human WISH cell/EMCV-based assay. 

44 The polypeptide of claim 41. having an antiproliferative activity of at 
, .east about 8.3x 10^ un.ts/mn,igram in a human Daudi eel, line - based assay or an annv.ral 
act.Uyofatleastabout2.xl0^un.s/mil,.graminahumanWISHcell/EMCV^^^^^^ 

assay. 

45. Thepolypeptideofclaim41. wherein thepolypeptideis l66amino 
acids in length. 

46 The polypeptide of claim 41. said polypeptide comprising amino acids 
A,al9(TyrorGi:)34,G.y37,Phe38,Lys71.Ala76.Tyr90.Ilel32.ArgI34.Phel^^ 

So andGlul66. Wherein the numberingof the amino acids of sa.dpolypept,d^ 
corresponds to the numbering of amino acids in SEQ ID NO:36. 

47. Thepolypeptideofclaim41,comprisingatleast 100 contiguous 
15 amino acid residues of any one of SEQ ID NOS.36-70. 

48. Thepolypeptideofclaim41.comprisingatleast 150 contiguous 
amino acid residues of any one of SEQ ID NOS:36-70. 

49. Thepolypeptideofdaim41,comprisingatleast 155 contiguous 
amino acid residues of any one of SEQ ID NOS:36-70. 

50 Thepolypeptideofclaim41.comprisinganaminoacidsequence 

. nf SEO ID NO 36 SEQ ID NO:37. SEQ ID NO:39, 
selected from the group consistmg of. SEQ ED N0.3e,. i> ^ 

SEQIDNO:40.SEQIDNO:41,SEQIDNO:42.SEQIDNO:45.andSEQIDNO.46. 

51 An isolated or recombinant polypeptide comprising an amino acid 
••.e,uencecompns;ngatleastl53cont,guousaminoacidsofanyoneM^ 

,5 70 theisolatedorrecomb,nantpolypept.decompnsmgaminoac.dsLysl60^^^^^^^^^ 
leinthenumbeH..«eammoac.dscorrespondstot^ 
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52. The polypeptide of claim 51, comprising an amino acid sequence 
selected from the group consisting of: SEQ ID NO:36. SEQ ID NO:37, SEQ ID NO:39. 
SEQ ID NO:40, SEQ ED NO:41, SEQ ID NO:42, SEQ ID NO:45, and SEQ ID NO:46. 

53. The polypeptide of claim 5 1 , said polypeptide having an 

5 antiproliferative activity of at least about 8.3x10^ units/milligram in milligram in a human 
Daudi cell line - based assay or an antiviral activity of at least about 2.1x10'^ 
. units/milligram in a human WISH cell/EMCV-based assay. 

54. The polypeptide of claim 31, 34, 37, 41, or 51, further comprising a 
secretion/localization sequence. 

10 55. The polypeptide of claim 31, 34, 37, 41. or 51, further comprising a 

polypeptide purification subsequence. 

56. The polypeptide of claim 55, wherein the sequence that facilitates 
purification is selected from the group consisting of: an epitope lag, a FLAG tag, a 
polyhistidine tag, and a GST fusion. 

15 57. The polypeptide of claim 31, 34, 37, 41, or 51, further comprising a 

Met at the N-terminus. 

58. The polypeptide of claim 31, 34, 37, 41, or 51, comprising a modified 

amino acid. 

59. The polypeptide of claim 58, wherein the modified amino acid is 

20 selected from the group consisting of: a glycosylated amino acid, a PEGylated amino acid, 
a famesylated amino acid, an acetylated amino acid, and a biolinylated amino acid. 

60. A composition comprising the polypeptide of claim 3 1, 34, 37, 41, or 
51 and an excipient. 

61. The composition of claim 60, wherein the excipient is a 
25 pharmaceutically acceptable excipient. 
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62. A composition comprising the polypeptide of claim 58 in a 
pharmaceutically acceptable excipient. 

63. A polypeptide which is specifically bound by a polyclonal antisera 
raised against at least one antigen, said at least one antigen comprising at least one amino 

5 acid sequence of SEQ ID NO:36 to SEQ ID NO:70 or SEQ ID NO:79 to SEQ ID NO:85, 
or a fragment thereof, wherein the antisera is subtracted with an IFN-alpha polypeptide 
encoded by a nucleic acid corresponding to one or more of GenBank accession number: 
J00210 (alpha-D), J00207 (Alpha- A), X02958 (Alpha-6), X02956 (Alpha-5). V00533 
(alpha-H), V00542 (alpha-14), V00545 (EFN-IB). X03125 (alpha-8), X02957 (alpha-i6), 
10 V00540 (alpha-21X X02955 (alpha-4b), V00532 (alpha^C), X02960 {alpha-7), X02961 
(alpha-10 pseudogene), R0067 (Gx-1), 101614, 101787, 107821, M12350 (alpha-F), 
M38289, V00549 (alpha-2a), and 108313 (alpha-Conl). 

64. An antibody or antisera produced by administering the polypeptide of 
claim 31, 34, 37. 41. or 51 to a mammal, which antibody or antisera specifically binds at 

15 least one antigen, said at least one antigen comprising a polypeptide comprising one or 
more of the amino acid sequences of SEQ ID NO:36 to SEQ ID NO:70 and SEQ ID 
NO:79 to SEQ ID NO: 85, or a fragment thereof, which antibody or antisera does not 
specifically bind to an IFN-a polypeptide encoded by a nucleic acid corresponding to one 
or more of GenBank accession number: J00210 (alpha-D). J00207 (Alpha- A), X02958 

20 (Alpha-6), X02956 (Alpha-5), V00533 (alpha-H). V00542 (alpha-14), V00545 (IFN-IB), 
X03125 (alpha-8). X02957 (alpha-16), V00540 (alpha-21), X02955 (alpha-4b), V00532 
(alpha-C), X02960 (alpha-7), X02961 (alpha-10 pseudogene), R0067 (Gx-1), 101614, 
101787, 107821, M12350 (alpha-F). M38289, V00549 (alpha-2a). and 108313 (alpha- 
Conl). 

25 65. An antibody or antisera which specifically binds a polypeptide, the 

polypeptide comprising a sequence selected from the group consisting of: SEQ ID NO:36 
to SEQ ID NO:70 or SEQ ID NO:79 to SEQ ID NO:85, wherein the antibody or antisera 
does not specifically bind to an IFN-alpha polypeptide encoded by a nucleic acid 
corresponding to one or more of GenBank accession number: J00210 (alpha-D), J00207 

30 (Alpha-A), X02958 (Alpha-6). X02956 (Alpha-5), V00533 (alpha-H). V00542 (alpha-14). 
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V00545 (IFN-IB), X03125 (alpha-8), X02957 (alpha-16), V00540 (alpha-2I), X02955 
(alpha-4b), V00532 (alpha-C), X02960 (aIpha-7), X02961 (alpha- 1 0 pseudogene), R0067 
(Gx-1), 101614, 101787, 107821, M12350 (alpha-F), M38289, V00549 (alpha.2a), and 
108313 (alpha-Con 1). 

5 66. A method of producing a polypeptide, the method comprising: 

introducing into a population of cells a nucleic acid of claim 1, 2, 3, 8, or 18, the 
nucleic acid operatively linked to a regulatory sequence effective to produce the encoded 
polypeptide; and 

culturing the cells in a culture medium to produce the polypeptide. 

10 67. A method of producing a polypeptide, the method comprising: 

introducing into a population of cells a recombinant expression vector comprising 

the nucleic acid of claim 1 , 2, 3, 8, or 1 8; and 

culturing the cells in a culture medium under conditions suitable to produce the 

polypeptide encoded by the expression vector. 

15 68. A method of inhibiting growth of population of tumor cells, the 

method comprising: 

contacting the population of tumor cells with an effective amount of a polypeptide 
of claim 31, 34, 37, 41, or 51 sufficient to inhibit growth of tumor cells in said population 
of tumor cells, thereby inhibiting growth of tumor cells in said population of cells. 

20 69. The method of claim 68, wherein the tumor cells are selected from the 

group consisting of: human carcinoma cells, human leukemia cells, human T-lymphoma 
cells, and human melanoma cells. 

70. The method of claim 68. wherein the tumor cells are in culture. 

71. A method of inhibiting the replication of a virus within at least one 
25 cell infected by the virus, the method comprising: 

contacting said at least one infected cell with an effective amount of a polypeptide 
of claim 31, 34, 37, 41, or 51 sufficient to inhibit viral replication in said at least one 
infected cell, thereby inhibiting replication of the virus in said at least one infected cells. 
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72. The method of claim 71, wherein the virus is an RNA virus. 

73. The method of claim 72, wherein the virus is a human 
immunodeficiency virus or a hepatitis C virus. 

74. The method of claim 71. wherein the virus is a DNA virus. 

75. The method of claim 74, wherein the virus is a hepatitis B virus. 

76. The method of claim 71, wherein the cells are cultured. 

77. A method of treating an autoimmune disorder in a patient, the method 
comprising: administering to the patient an effective amount of the polypeptide of claim 
3I,34,37.41,or5L 

78. The method of claim 77, wherein the autoimmune disorder is selected 
from the group consisting of multiple sclerosis, rheumatoid arthritis, lupus erythematosus, 
and type I diabetes. 

79. In a method of treating a disorder treatable by administration of 
interferon-alpha to a subject, an improved method comprising: administering to the subject 
an effective amount of the polypeptide of claim 31, 34, 37, 41, or 51. 

80. The method claim 79, wherein the disorder treatable by administration 
of interferon-alpha is selected from the group consisting of: sclerosis, rheumatoid arthritis, 
lupus erythematosus, and type 1 diabetes. 

81. A method of for making a modified or recombinant nucleic acid, the 
method comprising: 

recursively recombining a sequence of one or more nucleic acids of claim 1, 2. 3, 
8, or 18 with a sequence of one or more additional nucleic acids, each sequence of the one 
or more additional nucleic acids encoding an interferon-alpha or an amino acid 
subsequence thereof. 

82. The method of claim 8 1 , wherein said recursive recombination 
produces at least one library of recombinant interferon-alpha homologue nucleic acids. 
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83. A nucleic acid library produced by the method of claim 82. 

84. A population of cells comprising the library of claim 83. 

85. A recombinant inierferon-alpha homologue nucleic acid produced by 
the method of claim 82. 

5 86. A cell comprising the nucleic acid of claim 85. 

87. The method of claim 81, wherein the recursive recombination is 
performed in vitro. 

88. The method of claim 81, wherein the recursive recombination is 
performed in vivo or ex vivo. 

10 89. A composition comprising two or more nucleic acids of claim 1 , 2, 3. 

8, or 18. 

90. The composition of claim 89, wherein the composition comprises a 
library comprising at least ten nucleic acids. 

91. A method of producing a modified or recombinant interferon-alpha 
15 homologue nucleic acid comprising mutating a nucleic acid of claim 1, 2, 3, 8, or 18. 

92. The modified or recombinant interferon-alpha homologue nucleic 
acid produced by the method of claim 91 . 

93. A computer or computer readable medium comprising a database 
comprising a sequence record comprising one or more character strings corresponding to a 

20 nucleic acid or protein sequence selected from SEQ ID NO: 1 to SEQ ID NO: 85. 

94. An integrated system comprising a computer or computer readable 
medium comprising a database comprising one or more sequence records, each of said 
sequence records comprising one or more character strings corresponding to a nucleic acid 
or protein sequence selected from SEQ ID N0:1 to SEQ ID NO:85, the integrated system 

25 further comprising a user input interface allowing a user to selectively view said one or 
more sequence records. 
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95. -'The integrated system of claim 94, the computer or computer readable 
medium comprising an alignment instruction set which aligns the character strings with 
one or more additional character strings corresponding to a nucleic acid or protein 
sequence. 

5 96. The integrated system of claim 95, wherein the instruction set 

comprises one or more of: a local homology comparison determination, a homology 
alignment determination, a search for similarity determination, and a BLAST 
determination. 

97. The integrated system of claim 95, further comprising a user readable 
10 output element which displays an alignment produced by the alignment instruction set. 

98. The integrated system of claim 94, the computer or computer readable 
medium further comprising an instruction set which translates at least one nucleic acid 
sequence comprising a sequence selected from SEQ ID N0:1 to SEQ ID NO:35 or SEQ 
ID NO:72 to SEQ ID NO:78 into an amino acid sequence. 

15 99. The integrated system of claim 94, the computer or computer readable 

medium further comprising an instruction set for reverse-translating at least one amino 
acid sequence comprising a sequence selected from SEQ ID NO:36 to SEQ ED NO:70 or 
SEQ ID NO:79 to SEQ ID NO:85 into a nucleic acid sequence. 

100. The integrated system of claim 99, wherein the instruction set selects 
20 the nucleic acid sequence by applying a codon usage instruction set or an instruction set 

which determines sequence identity to a test nucleic acid sequence. 

101. A method of using a computer system to present information 
pertaining to at least one of a plurality of sequence records stored in a database, said 
sequence records each comprising at least one character string corresponding to SEQ ID 

25 N0:1 to SEQ ID NO:85, the method comprising: 

determining a list of at least one character string corresponding to one or more of 
SEQ ID NQ:I t0:SEQ ED NO:85 or a subsequence thereof; 

deterrninrng^ which of said at least one character string of said list are selected by a 
user; and 
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displaying each of the selected character strings, or aligning each of the selected 
character strings with an additional character string. 

102. The method of claim 101, further comprising displaying an alignment 
of each of the selected character strings with the additional character string. 

5 103. The method of claim 101. further comprising displaying the list. 

104. A nucleic acid which comprises a unique subsequence in a nucleic 
acid selected from SEQ © NO:l to SEQ ID NO:35 or SEQ ID NO:72 to SEQ ID NO:78, 
wherein the unique subsequence is unique as compared to a nucleic acid sequence of a 
known interferon-alpha nucleic acid sequence or a nucleic acid corresponding to any of 
10 GenBank accession number: J00210 (alpha-D), J00207 (Alpha- A). X02958 (Alpha-6), 
X02956 (Alpha-5), V00533 (alpha-H), V00542 (alpha- 14), V00545 (IFN-IB), X03125 
(alpha-8). X02957 (alpha-16), V00540 (alpha-21). X02955 (alpha-4b), VO0532 (alpha-C), 
X02960 (alpha-7). X02961 (alpha-10 pseudogene), R0067 (Gx-1). 101614, 101787, 
107821, M12350 (alpha-F). M38289, V00549 (alpha-2a), and I083I3 (alpha-Conl). 

15 105. A polypeptide which comprises a unique subsequence in a 

polypeptide selected from: SEQ ID NO:36 to SEQ ID NO:70 or SEQ ID NO:79 to SEQ 
ID NO:85, wherein the unique subsequence is unique as compared to a sequence of a 
known interferon-alpha polypeptide or a sequence of a polypeptide encoded by a nucleic 
acid corresponding to any of GenBank accession number: J00210 (alpha-D), J00207 

20 (Alpha-A), X02958 (Alpha-6). X02956 (Alpha-5), V00533 (alpha-H), V00542 (alpha- 14), 
V00545 (IFN-IB), X03125 (alpha-8), X02957 (alpha-16). V00540 (alpha-21). X02955 
(alpha-4b). V00532 (alpha-C), X02960 (alpha-7). X02961 (alpha-10 pseudogene), R0067 
(Gx-1), 101614, 101787. 107821. M12350 (alpha-F), M38289. V00549 (alpha-2a), and 
108313 (alpha-Conl). 

25 106. A target nucleic acid which hybridizes under stringent conditions to a 

unique coding oligonucleotide which encodes a unique subsequence in a polypeptide 
selected from: SEQ ID NO:36 to SEQ ID NO:70 or SEQ ID NO:79 to SEQ ID NO:85. 
wherein the unique subsequence is unique as compared to a sequence of a known 
' interferon-alpha polypeptide or a sequence of a polypeptide encoded by a nucleic acid 
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corresponding taany of GenBank accession number: J00210 (alpha-D), J00207 (Alpha- 
A), X02958 (AIpha-6), X02956 (AIpha-5), V00533 (alpha-H). V00542 (alpha-M), 
V00545 (IFN-IB), X03125 (alpha-8), X02957 (aIpha-16), V00540 (alpha-21), X02955 
(alpha-4b), V00532 (alpha-C), X02960 (alpha-7), X02961 (alpha-IO pseudogene), R0067 
5 (Gx-1), 101614, 101787. 107821. Mi2350 (alpha-F), M38289. V00549 (alpha-2a), and 
108313 (alpha-Con 1). 

107. The' nucleic acid of claim 106, wherein the stringent conditions are 
selected such that a perfectly complementary oligonucleotide to the unique coding 
oligonucleotide hybridizes to the unique coding oligonucleotide with at least a 5x higher 

10 signal to noise ratio than for hybridization of the perfectly complementary oligonucleotide 
to a control nucleic acid corresponding to any of GenBank accession number: J00210 
(alpha-D), J00207 (Alpha-A), X02958 (Alpha-6), X02956 (Alpha-5), V00533 (alpha-H), 
V00542 (alpha-14). V00545 (IFN-IB), X03125 (alpha-8). X02957 (alpha-16), V00540 
(alpha-21), X02955 (alpha-4b). V00532 (alpha-C). X02960 (alpha-7), X02961 (alpha-10 

15 pseudogene), R0067 (Gx-1), 101614, 101787, 107821, M12350 (alpha-F), M38289.V00549 
(alpha-2a), and 108313 (alpha-Con 1), wherein the target nucleic acid hybridizes to the 
unique coding oligonucleotide with at least a 2x higher signal to noise ratio as compared to 
hybridization of the control nucleic acid to the coding oligonucleotide. 

108. The nucleic acid of any of claims 1, 2, 3. 8, or 18, wherein the nucleic 
20 acid encodes an interferon-alpha homologue having an increased growth inhibition 

activity against a population of cancer cells relative to a growth inhibition activity of 
human interferon-alpha 2a against said population of cancer cells. 

109. The nucleic acid of claim 108, wherein the cancer cells of said 
population of cancer cells comprise a cancer cell line selected from: a leukemia cell line, a 

25 melanoma cell line, a lung cancer cell line, a colon cancer cell line, a central nervous 
system (CNS) cancer cell line, an ovarian cancer cell line, a breast cancer cell line, a 
prostate cancer cell line, and 2l renal cancer cell line, and the growth inhibition activity is 
measured as a concentration of interferon-alpha homologue producing a 50% inhibition of 
growth of the cancer cell line (GI50 value), wherein the interferon-alpha homologue has a 

30 GI50 value at least 2-fold lower than the GI50 value of the human interferon-alpha 2a. 
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110. The nucleic acid of claim 109, wherein the encoded interferon-alpha 
homologue has a GI50 value at least 5-fold lower than the GI50 value of the human 
interferon-alpha 2a. 

111. The nucleic acid of claim 107, wherein the encoded inierferon-atpha 
5 homologue has a GI50 value at least 10-fold lower than the GI50 value of the human 

interferon-alpha 2a. 

112. The nucleic acidof any of claims I, 2, 3, 8, or 18, wherein the nucleic 
acid encodes an interferon-alpha homologue having increased an cytostatic activity against 
a population of cancer cells relative to the cytostatic activity of human interferon-alpha 2a 

10 against said population of cancer cells. 

113. The nucleic acid of claim 1 12, wherein the cancer cells comprise a 
cancer cell line selected from: a leukemia cell line, a melanoma cell line, a lung cancer cell 
line, a colon cancer cell line, a CNS cancer cell line, an ovarian cancer cell line, a breast 
cancer cell line, a prostate cancer cell line, and a renal cancer cell line, the cytostatic 

15 activity measured as the concentration of an interferon-alpha causing a total inhibition of 
growth of the cell line (TGI value), wherein the interferon-alpha homologue has a TGI 
value at least 2-fold lower than the TGI value of the human interferon-alpha 2a. 

114. The nucleic acid of claim 1 12, wherein the encoded interferon-alpha 
homologue has a TGI value at least 5-fold lower than the TGI value of the human 

20 interferon-alpha 2a. 

115. The nucleic acid of claim 112, wherein the encoded interferon-alpha 
homologue has a TGI value at least 10-fold lower than the TGI value of the human 
interferon-alpha 2a. 

116. The nucleic acid of any of claims 1. 2, 3, 8, or 18, wherein the nucleic 
25 acid encodes an interferon-alpha homologue having an increased cytotoxic activity against 

a population of cancer cells relative to the cytotoxic activity of human interferon-alpha 2a 
against said population of cancer cells. 
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117. The nucleic acid of claim 1 16, wherein the cancer cells comprise a 
cancer cell line selected from: a leukemia cell line, a melanoma cell line, a lung cancer cell 
line, a colon cancer cell line, a central nervous system (CNS) cancer cell line, an ovarian 
cancer cell line, a breast cancer cell line, a prostate cancer cell line, and a renal cancer cell 

5 line, the cytotoxic activity measured as the concentration of interferon-alpha producing a 
50% reduction in an amount of cellular protein in a cell line measured after a period of 
incubation (LC50 value), wherein the interferon-alpha homologue has a LC50 value at 
least 2-foId lower than the LC50 value of the human interferon-alpha 2a. 

118. The nucleic acid of claim 116, wherein the encoded interferon-alpha 
. 10 homologue has a LC50 value at least 5-foId lower than the LC50 value of the human 

interferon-alpha 2a. 

119. The nucleic acid of claim 116, wherein the encoded interferon-alpha 
homologue has a LC50 value at least 10-fold lower than the LC50 value of the human 
interferon-alpha 2a. 

15 120. The polypeptide of any of claims claim 31, 34, 37. 41, or 51, said 

polypeptide having an increased growth inhibition activity against a population of cancer 
cells relative to the inhibition activity of human interferon-alpha 2a against the population 
of cancer cells. 

121. The polypeptide of claim 120, wherein the population of cancer cells 
20 comprises a cancer cell line selected from: a leukemia cell line, a melanoma celLline, a 

lung cancer cell line, a colon cancer cell line, a CNS cancer cell line, an ovarian cancer 
cell line, a breast cancer cell line, a prostate cancer cell line, and a renal cancer cell line, 
the growth inhibition activity measured as the concentration of polypeptide or human 
interferon-alpha 2a causing a 50% inhibition of growth of the cell line (GI50 value), 
25 wherein the polypeptide has a G150 value at least 2-fold lower than the G150 value of the 
human interferon-alpha 2a. 

122. A nucleic acid produced by the method of claim 81 . 

' / 123. An interferon-alpha polypeptide or amino acid subsequence thereof 

produced by the method of claim 8 1 . 
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SEQUENCE LISTING 

<110> HEINRICHS, VOLKER 
5 CHEN. TEDDY 

PATTEN, PHILLIP A. 



<120> IFN-ALPHA HOMOLOGUES 



10 <130> 02-101510/0140.002 



<140> 
<141> 



15 <150> 09/415.183 
<151> 1999-10-07 



<160> aa 



20 <170> Patentin Ver. 2.0 

<210> 1 
<211> 498 
<212> DNA 
25 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic DNA 



30 <220> 

<223> Clone ID 2DH12 



<400> 1 

tgtgatctgc ctcagaccca cagccttggc 
35 atgggacgaa tctctccttt ctcctgcctg 
gaggagtttg atggcaacca gttccagaag 
atccagcaga ccttcaatct cttcagcaca 
ctcctagaaa aattttccac tgaactctac 
atacaggagg taggggtgaa agagactccc 
40 aggaagtact tccaaagaat cactctttat 
tgggaggttg tcagagcaga aatcatgaga 
agattaagga ggaaggaa 



aacaggaggg ccttgatgct cctggcacaa 60 
aaggacagac aagactttgg attcccccag 120 
gctcaagcca tctctgtcct ccatgagatg 180 
aaggattcat ctgctgcttg ggaacagacc 240 
cagcagctga atgacctgga agcctgcgtg 300 
ctgatgaatg tggactccat cctggctgtg 360 
ctaatagaga ggaaatacag cccttgtgca 420 
tctttctctt tttcaacaaa cttgcaaaaa 480 

498 



<210> 2 
45 <211> 498 
<212> DNA 

<213> Artificial Sequence 



<220> 

50 <223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID 2CA3 



55 <400> 2 

^ tgtgatctgc ctcagaccca 
atgggacgaa tctctccttt 
gaggagtttg atggcaacca 



cagccttggt gacaggaggg 
ctcctgcctg aaggacagat 
gttccagaag gctcaagcca 



ccatgatact cctggcacaa 60 
atgatttcgg attcccccag 120 
tctctgtcct ccatgagatg 180 
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atccagcaga ccttcaatct cttcagcaca aaggattcat ctgctgcttg ggaacagagc 240 

ctcctagaaa aattttccac tgaactttac cagcagctga atgaactgga agcatgtgtg 300 

atacaggagg ttggggtggg agagactccc ctgatgaatg gggactccat cctggctgcg 360 

aagaagtact tccaaagaat caccctttat ccaacagaga ggaaatacag cccttgtgca 420 

5 tgggaggttg tcagagcaga aatcatgaga tctttctctt ttccaacaaa cttgcaaaaa 480 

agattaagga ggaaggaa 498 

<210> 3 
<211> 498 
10 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic DNA 



15 



<220> 

<223> Clone ID 4AB9 



<400> 3 

20 tgtgatctgc ctcagaccca cagccttggc aacaggaggg ccttgatact cctggcacaa 60 

atgggacgaa tctctccttt ctcctgcctg aaggacagac atgactttgg attcccccgg 120 

gaggagtttg atggcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 180 

atgcagcaga ccttcaatct cttcagcaca aagaactcat ctgctgcttg ggatgagacc 240 

ctcctagaaa aattttccac tgaactttac cagcaactga atgaactgga agcatgtgtg 300 

25 atacaggagg ttggggtgga agagactccc ctgatgaatg aggactccat cctggctgtg 360 

aagaaatact tccaaagaat cactctttat ctgacagaga agaagtatag cccttgctcc 420 

tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 
agattaagga ggaaggaa 498 

30 <210> 4 

<211> 498 
<212> DNA 

<213> Artificial Sequence 
35 <220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID 2DA4 

40 

<400> 4 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatgct cctggcacaa 60 
atgggaagaa tctctccttt ctcctgcctg aaggacagac aagactttgg attcccccag 120 
gaggagtttg atagcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 180 

45 atgcagcaga ccttcaatct cttcagcaca aaggactcat ctgctgcttg ggatgagacc 240 
ctcctagaaa aattttccac tgaactctac cagcagctga atgacctgga agcctgcgtg 300 
atacaggagg ttggggtgga agagaccccc ctgatgaatg tggactccat cctggctgtg 360 
aggaagtact tccaaagaat cactctttat ctaatagaga ggaaatacag cccttgtgca 420 
tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 

50 agattaagga ggaaggaa 498 

<210> 5 
<211> 498" 
<212> DNA 
55 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic DNA 



2 
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25 



<220> . . 

<223> Clone ID 3DA11 



<400> 5 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccctggtact cctggcacaa 

atgggaagaa tctctccttt ctcctgcctg aaggacagat atgatttcgg atccccccag 

gaggagtttg atggcaacca gttccagaag gctcaagcca cctctgccct ccatgagatg 

atccagcaga ccttcaatct cttcagcaca aaggattcat ctgctgctCg ggatgagacc 

ctcctagaaa aatctcccac tgaacttcac cagcagccga atgacctgga agcctgcgtg 



60 
120 
180 
240 
300 

10 atacaggagg ttggggtgga agagaccccc ctgatgaatg aggactccat cctggctgtg 360 
aagaaatact tccaaagaat cactctttat ctaatagaga ggaaatacag cccttgtgca 
cgggaggtcg tcagagcaga aatcatgaga tctttctcct tttcaacaaa cttgcaaaaa 
agattaagga ggaaggaa 

15 <210> 6 

<211> 498 
<212> DNA 

<213> Artificial Sequence 
20 <220> 

<223> Description of Artificial Sequence: Synthetic DNA 



30 atccagcaga ccttcaatct cttcagcaca aaggattcat ctgctgcttg ggatgagacc 
ctcctagaaa aattttccac tgaactttac cagcagctga atgacttgga agcctgtgtg 



420 
480 
498 



<220> 

<223> Clone ID 2DB11 
<400> 6 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatgct cctggcacaa 
atgggaagaa tctctccttt ctcctgcctg aaggacagat atgatttcgg attcccccag 
gaggagtttg atggcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 180 

. 240 

300 



60 
120 



420 
480 



atacaggagg ttggggtgga agagactccc ctgatgaatg tggactccat cctggctgtg 360 
aggaagtact tccaaagaat cactctttat ctaatagaga ggaaatacag cccttgtgca 
tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 
35 agattaagga ggaaggaa 

<210> 7 
<211> 498 
<212> DNA 
40 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic DNA 

45 <220> 

<223> Clone ID 2CA5 

<400> 7 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatact cctggcacaa 60 

50 atgggacgaa tctctccttt ctcctgcctg aaggacagac aagactttgg attcccccag 120 

gaggagtttg atggcaaccg gttccagaag gctcaagcca tctctgtcct ccatgagatg 180 

atccagcaga ccttcaatct cttcagcaca aagaactcat ctgctgcttg ggaacagagc 

ctcctagaaa aattttccac tgaactctac cagcagctga atgacctgga agcctgcgtg 

atacaggagg ttggggtgga agagaccccc ctgatgaatg aggactccat cctggctgtg 360 



240 
300 



55 aagaaatact tccaaagaat cactctttat ctaatagaga ggaaatacag cccttgtgca 
tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 
agattaagga ggaaggaa 



420 
480 
498 



<210> 8 



3 
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10 



<211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID 2G6 



<400> 8 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatact cctggcacaa 60 

atgggaagaa tctctccttt ctcctgcctg aaggacagac atgactttgg attcccccag 120 

gaggagtttg atggcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 180 

15 atccagcaga ccttcaatct cttcagcaca aaggactcat ctgctacttg ggaacagagc 240 

ctcctagaaa aattctccac tgaacttaac cagcagctga atgacctgga agcctgcgtg 300 

atacaggagg ttggggtgga agagactccc ctgatgaatg tggaccccat cctggctgtg 360 

aagaaatact tccaaagaat cactctctat ctgacagaga agaaatacag cccttgtgcc 420 

tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 

20 agattaagga ggaaggaa 498 

<210> 9 
<211> 498 
<212> DNA 
25 <213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic DNA 

30 <220> 

<223> Clone ID 3AH7 



<400> 9 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatact cctggcacaa 60 

35 atgcgaagaa tctctccttt ctcctgcctg aaggacagac atgactttgg attcccccag 120 

gaggagtttg atagcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 180 

atccagcaga ccttcaatct cttcagcaca aaggattcat ctgctgcttg ggaacagagc 240 

ctcctagaaa aattttccac tgaacttcac cagcaactga atgaactgga agcatgtgta 300 

gtacaggagg ttggggtgga agagactccc ctgatgaatg aggactccat cctggctgtg 360 

40 aagaaatacc tccaaagaat cactctttat ctgacagaga agaagtatag cccttgtgca 420 

tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 
agattaagga ggaaggaa . 498 

<210> 10 
45 <211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

50 <223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID 2G5 
55 <400> 10 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatgct cctggcacaa 60 
atgggaagaa tctctccttt ctcctgcctg aaggacagac aagactttgg attcccccag 120 
gaggagtttg atggcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 180 
atccagcaga ccttcaatct cttcagcaca aaggattcat ctgctgcttg ggaacagagc 240 



4 
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50 



55 



ctcctagaaa aattttccac tgaactctac cagcagccga atgacctgga agcctgcgtg 300 

atacaggagg ttggggtgga agagaccccc ctgatgaatg tggactccac cctggctgtg 360 

aggaagtacc tccaaagaac cactcct.tat ccaatagaga ggaaatacag ccctcgtgca 420 

tgggaggttg tcagagcaga aatcatgaga tccttctctt cttcaacaaa cctgcaaaaa 480 



5 agattaagga ggaaggaa 

<210> 11 

<211> 498 

<212> DNA 

10 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic DNA 

15 <220> 

<223> Clone ID 2BA8 

<400> 11 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccctgatact cctggcacaa 



498 



60 



180 

240 

300 



420 
480 
498 



20 atgggacgaa tctctccttt ctcctgcctg aaggacagat atgatttcgg attcccccag 120 
gaggagtttg atggcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 
atccagcaga ccttcaatct cttcagcaca aaggattcat ctgctgcttg ggaacagagc 
ctcctagaaa aattttccac tgaactttac cagcagctga atgacctgga agcctgcgtg 
atacaggagg ttggggtgga agagaccccc ctaatgaatg tggactccat cctggctgtg 360 
25 aggaagtact tccaaagaat cactctttat ctaatagaga ggaaatacag cccttgtgca ^'^'^ 
tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 
agattaagga ggaaggaa 

<210> 12 
30 <211> 498 
<212> mPi 

<213> Artificial Sequence 
<220> 

35 <223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID 1F3 
40 <400> 12 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatact cctgggacaa 
atgggaagaa tctctcattt ctcctgcctg aaggacagac atgactttgg attcccccag 120 
gaggagtttg atggcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 180 
atccagcaga ccttcaacct cttcagcaca aaggactcat ctgttgcttg ggatgagagg 240 
45 cttctagaca aactctatac tgaactttac cagcagctga atgacctgga agcctgtgtg 



60 



300 

atgcaggagg tgtgggtggg agggactccc ctgatgaatg aggactccat cctggctgtg 360 

480 
498 



C» So^ V««*^'»^Vt«;^vj — — — — 

agaaaatact tccaaagaat cactctctat ctgacagaga agaaatacag cccttgtgcc 
tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 
agattaagga ggaaggaa 

<210> 13 

<211> 498 . 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<220> 



^ 
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<223> Clone ID 4BE10 



<400> 13 

tgcgatctgc cccagaccca cagccttggt aacaggaggg ccttgatact cctggcacag 60 

5 atgggacgaa tctctccttt ctcctgcctg aaggacagat atgatttcgg attcccccag 120 

gaggagtttg atggcaacca gttccagaag gctcaagcca tctctgtccc ccatgagata 180 

atgcagcaga ccttcaatct cttcagcaca aagaactcat ctgctgcttg ggatgagacc 240 

ctcctagaaa aacttcccac tgaaccttac cagcaactga atgaactgga agcatgtgcg 300 

atacaggggg ttggggtgga agagactccc ctgatgaatg aggactccat cctggctgtg 360 

10 aggaaatact tccaaagaat cactctttat ctgacagaga agaagtatag cccttgttcc 420 

tgggaggttg tcagagcaga aatcatgaga cccccctctt tttcaacaaa cttgcaaaaa 480 

agattaagga ggaaggaa 498 



<210> 14 
15 <211> 498 
<212> DNA 

<213> Artificial Sequence 



<220> 

20 <223> Description of Artificial Sequence: Synthetic DNA 



<220> 

<223> Clone ID 2DD9 



25 <400> 14 

tgtgatctgc ctcagaccca 

atgggaagaa tctccccttt 

gaggagtttg atggcaacca 

atccagcaga ccttcaatct 
30 ctcctagaaa aattttccac 

atacaggagg ttggggtgga 

aagaaatact tccaaagaat 

tgggaggttg tcagagcaga 

agattaagga ggaaggaa 

35 

<210> 15 
<211> 498 
<2I2> DNA 
<213> Artificial Sequence 

40 

<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<220> 

45 <223> Clone ID 3CA1 



cagccttggt aacaggaggg 
ctcctgcctg aaggacagat 
gttccagaag gctcaagcca 
cttcagcaca aaggattcat 
tggactctac cagcagctga 
agagaccccc ctgatgaatg 
cactctttat ctgacagaga 
aatcatgaga tctttctctt 



ccttgatgct cctggcacaa 60 

atgatttcgg attcccccag 120 

tctctgtcct ccatgagatg 180 

ctgctgcttg ggaacagagc 240 

atgacctgga agcctgcgtg 300 

aggactccat cctggctgtg 360 

agaagtatag cccttgttcc 420 

tttcaacaaa cttgcaaaaa 480 

498 



<400> 15 

tgtgatctgc ctcagaccca cagccttggc 
atgggaagaa tctctccttt ctcctgcctg 

50 gaggagtttg atggcaacca gttccagaag 
atccagcaga ccttcaatct cttcagcaca 
ctcctagaaa aattttccac tgaactttac 
atacaggagg ttgggatgga agagactccc 
aagaaatact tccaaagaat cactctttat 

55 tgggaggttg tcagagcaga aatcatgaga 
agattaagga ggaaggaa 

<210> 16 
<211> 498 



aacaggaggg ccttgatact cctggcacaa 60 
aaggacagac atgactttgg attaccccag 120 
gctcaagcca tctctgtcct ccatgagatg 180 
aagaactcat ctgctgcttg ggatgagacc 240 
cagcaactga ataacctgga agcatgtgtg 300 
ctgatgaatg tggactccat cctggctgtg 360 
ctgacagaga agaagtatag cccttgtgcc 420 
tctttctctt tttcaacaaa cttgcaaaaa 480 

498 



6 
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<212> DNA . 

<213> Artificial Sequence 
<220> 

5 <223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID 2F8 
10 <400> 16 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatact cctggcacaa 60 
atgggacgaa tctctccttt ctcctgcctg aaggacagat atgatttcgg attcccccag 120 
gaggagtttg atggcaacca gttccagaag gctcaagcca tctctgccct ccatgagatg 180 
atgcagcaga ccttcaatct cttcagcaca aagaactcat ctgctgcttg ggatgagacc 240 
15 ctcctagaaa aattttccac tgaactttac cagcaactga atgaactgga agcatgtgtg 300 
atacaggagg ttggggtgga agagactccc ctgatgaatg aggactccat cctggctgtg 360 
aagaaatact tccaaagaat cactctttat ctgacagaga agaagtatag cccttgttcc 420 
tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 
agattaagga ggaaggaa 

20 



25 



50 



498 



<210> 17 
<211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<220> 

30 <223> Clone ID 6CG3 
<400> 17 

tgtgatctgc ctcagaccca cagccttggt aacaagaggg ccatgatgct cctggcacaa 60 
atgggaagaa cctctccttt ctcctgtctg aaggacagac atgactttgg attcccccag 120 

35 gaggagtttg atggcaacca gttccagagg gctcaagcca tctttgtcct ccatgagatg 180 
atccagcaga ccttcaattt cttcagcaca aaggactcat ctgctgcttg ggaacagagc 240 
ctcctagaaa aattttccac tgaacttaac cagcagctga atgacctgga agcctgcgtg 300 
atacaggaag ttggggtgga agagactccc ctgatgaatg aggactccat cctggctgtg 360 
aagaaatact tccaaagaat cactctttat ctgacagaga agaaatacag cccttgtgcc 42 0 

40 tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 
agattaagga ggaaggaa 

<210> 18 . 
<211> 498 
45 <212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID 3CG7 



<400> 18 

55 tgtgatctgc ctcagaccca cagccttggt aacagtaggg ccttgatgct' cctggcacaa. 60 
atgggaagaa tctccccttt ctcctgcctg aaggacagac atgatttcgg attcccccag 120 
^ gaggagtttg atggcaacca gttccagaag gctcaagcca tctctgcctt ccatgagatg 180 
atccagcaga ccttcaatct cttcagcaca aaggattcat ctgctgcttg ggaacagaac 240 
ctcctagaaa aattttccac tgaactttac cagcaactga ataacctgga agcatgtgtg 300 



7 



OCID: <WO 0l25i38A2_l_> 



wo 01/25438 



jPCT/US«0/27781 



10 



35 



atacaggagg ttgggatgga agagactccc ctgatgaatg tggactccat cctggctgtg 360 

aggaagtacc tccaaagaat cactctttat ctaacagaga ggaaatacag cccccgtgcc 420 

tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa ctcgcaaaaa 480 

agattaagga ggaaggaa 498 

<210> 19 
<211> 498 
<212> DNA 

<213> Artificial Sequence- 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 



<220> 

15 <223> Clone ID 1D3 
<400> 19 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatact cctggcacaa 60 

atgggaagaa tctctcattt ctcctgcctg aaggacagac atgatttcgg attcccccag 120 

20 gaggagtctg atggccacca gttccagaag actcaagcca tctctgtcct ccatgagatg 180 

atccagcaga ccttcaatct cttcagcaca aaggactcat ctgctgcttg ggaacagagc 240 

ctcctagaaa aattttccac tgaactttac cagcaactga atgacctgga agcatgtgtg 3 00 

atacaggagg ttggggtgga agagactccc ctgatgaatg aggactccat cctggctgtg 360 

aagaaatact tccaaagaat cactctttat ctgatggaga agaaatacag cccttgtgcc 420 

25 tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 
agattaagga ggaaggaa 498 

<210> 20 

<211> 498 

30 <212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID 2G4 



<400> 20 

40 ..tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccatgatgct cctggcacaa ^60 

atgagcagaa tctctccttc ctcctgtctg atggacagac atgactttga atttccccag 120 

gaggaatttg atgataaaca gttccagaag gctccagcca tctctgtcct ccatgaggtg 180 

attcagcaga ccttcaatct cttcagcaca gaggactcat ctgctgcttg ggaacagacc 240 

ctcctagaaa aattttccac tgaactttac cagcaactga atgacctgga agcatgtgtg 300 

45 atgcaggagg agagggtggg agaaactccc ctgatgaatg cggactccat cttggctgtg 360 

aggaaatact tccaaagaat cactctttat ctgacaaaga agaagtatag cccttgttcc 420 

tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 
agattaagga ggaaggaa 498 

50 <210> 21 
<211> 498 
<212> DNA 

<2i3> Artificial Sequence 
55 <220> 

<223> Description of Artificial Sequence: Synthetic DNA 

<220> 

<223> Clone ID lAl 
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60 



20 



45 



<400> 21 

tgtgatctgc cccagaccca cagccttggt aacaggaggg ccttgatact cctggcacaa 

atgggaagaa tctctcattt ctcctgcctg aaggacagat atgatttcgg attcccccag 120 

5 gaggtgtttg atggcaacca gttccagaag gcccaagcca tctctgccct ccatgagatg 180 

atgcagcaga ccttcaatct cttcagcaca gaggactcat ctgctgcctg ggaacagagc 240 

ctcctagaaa aattttccac cgaacttcac cagcaactga atgacctgga agcctgtgtg 300 

atacaggagg ttggggtgga agagactccc ctgatgaatg aggactccat cctggctgtg 360 

aggaaatact ctcaaagaat cactctttat ctaatggaga agaaatacag cccttgcgcc 420 

10 tgggaggctg tcagagcaga aatcatgaga tctccctctt ttccaacaaa cttgcaaaaa 480 
agattaagga ggaaggaa 498 

<210> 22 
<211> 498 
15 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence; Synthetic DNA 



<220> 

<223> Clone ID IDIO 



<400> 22 

25 tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatact cctggcacaa 60 

atgggaagaa tctctcattt ctcctgcctg aaggacagac atgatttcgg attcccccag 120 

gaggagtttg atggccacca gttccagaag actcaagcca tctctgtcct ccatgagatg 180 

atccagcaga ccttcaatct cttcagcaca aaggactcat ctgctgcttg ggaacagagc 240 

ctcctagaaa aattttccac tgaactttac cagcaactga atgacctgga agcatgtgtg 300 

30 atacaggagg ttggggtgga agagactccc ctgatgaatg aggactccat cctggctgtg 3 60 

aagaaatact tccaaagaat cactctttat ctgatggaga agaaatacag cccttgtgcc 420 

tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 

agattaagga ggaaggaa 498 

35 <210> 23 
<211> 498 
<212> DNA 

<213> Artificial Sequence 
40 <220> 

<223> Description of Artificial Sequence: Synthetic DNA 



<22a> 

<223> Clone ID 1F6 



<400> 23 

tgtgatctgc ctcagaccca cagccttggt aacaggagga ctttgatgat aatggcacaa 60 

atgggaagaa tctctccttt ctcctgcctg aaggacagac atgactttgg atttccccag 120 

gaggagtttg atggcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 180 

50 atccagcaga ccttcaatct cttcagcaca aaggactcat ctgctacttg ggaacagagc 240 

ctcctagaaa aattttccac tgaacttaac cagcagctga atgacctgga agcctgcgtg 300 

atacaggagg ctggggtgga agagactccc ctgatgaatg tggactccat cctggctgtg 3 60 

aagaaatact tccaaagaat cactctttat ctaacagaga agaaatacag cccttgtgcc 420 

tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 

55 agattaagga ggaaggaa . 498 

V <210> 24 
<211> 498 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 

<220> 

<223> Clone ID 2A10 



<400> 24 
10 tgtgatctgc 
atgggaagaa 
gaggtgcttg 
atccagcaga 
ctcctagaaa 
IS atacaggagg 
aggaaatact 
tgggaggttg 
agattaagga 



ctcagaccca 
tctctcattt 
atggcaacca 
ccttcaatct 
aattttccac 
ttggggtgga 
ttcaaagaat 
tcagagcaga 
ggaaggaa 



cagccttggt 
ctcccgcccg 
gttccagaag 
cttcagcaca 
tgaactttac 
agagactccc 
cactctttat 
aatcatgaga 



aacaggaggg 
aaggacagat 
gctcaagcca 
aaggactcat 
cagcaaccga 
ctgatgaatg 
ctgatggaga 
tctttctctt 



ccttgatact 
atgatttcgg 
tccctgcctt 
ctgctacctg 
ataacctgga 
aggactccat 
agaaatacag 
tttcaacaaa 



cctggcacaa 
attcccccag 
ccatgagatg 
ggaacagagc 
agcatgtgtg 
cccggctgtg 
cccttgtgcc 
cttgcaaaaa 



60 
120 
180 
240 

300 
360 
420 
480 
498 



20 <210> 25 
■<211> 498 
<212> DNA 

<213> Artificial Sequence 



25 <220> 

<223> Description of Artificial Sequence: Synthetic DNA 



<220> 

<223> Clone ID 2C3 

30 

<400> 25 

tgtgatctgc ctcagaccca 
atgggaagaa tctctccttt 
gaggagtttg atggcaacca 

35 atccagcaga ccttcaatct 
cttttagaaa aattttccac 
atacaggagg ttggggtgga 
aagaaatact tccaaagaat 
tgggaggttg tcagagcaga 

40 agattaagga ggaaggaa 



cagccttggt aacaggaggg 
ctcctgcctg aaggacagac 
gtcccagaag gctcaagcca 
cttcagcaca aaggactcat 
tgaacttaac cagcagctga 
agagaccccc ctgatgaatg 
cactctttat ctgacagaga 
aatcatgaga tctttctctt 



ccttgatact cctggcacaa 60 
atgactttgg atttcctcag 120 
tctctgtcct ccatgagatg 180 
ctgatacttg ggatgcgacc 240 
atgacctgga agcctgcgtg 300 
tggactccat cctggctgtg 360 
agaaatacag cccttgtgcc 420 
tttcaacaaa cttgcaaaaa 480 

498 



•<210> 26 

<211> 498 

<212> DNA 

45 <213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic DNA 

50 <220> 

<223> Clone ID 2D1 



55 



<400> 26 

tgtgatctgc 

atgggacgaa 

gaggagtttg 

atccagcaga 

ctcctagaaa 

atacaggagg 



ctcagaccca 
tctctccttt 
atggcaaccg 
ccttcaatct 
aattttccac 
ttggggtgga 



cagccttggt 
ctcctgcctg 
gttccagaag 
cttcagcaca 
tgaactctac 
agagaccccc 



aacaggaggg 
aaggacagac 
gctcaagcca 
aagaactcat 
cagcagctga 
ctgatgaatg 



ccttgatact 
aagactttgg 
tctctgtcct 
ctgctgcttg 
atgacctgga 
aggactccat 



cctggcacaa 60 
attcccccag 120 
ccatgagatg 180 
ggaacagagc 240 
agcctgcgtg 300 
cctggctgtg 360 
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15 



498 



aagaaatact cccaaagaat caccctttac ctaatagaga ggaaatacag cccttgtgca 420 
tgggaggttg tcagagcaga aatcatgaga tccttctctt tttcaacaaa cctgcaaaaa 480 
agatcaagga ggaaggaa 

5 <210> 27 
<211> 498 
<212> DNA 

<213> Artificial Sequence 
10 <220> 

<223> Description of Artificial Sequence: Synthetic DNA 



<220> 

<223> Clone ID 2D10 



<400> 27 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatact cctggcacaa 60 
atgggaagag tctctccttt ctcctgcctg aaggacagac atgactttgg attcccccag 120 
gaggagtttg atggcaacca gttccagaag gctcaagcca tctctgcctt ccatgagatg 
20 atccagcaga ccttcaatct cttcagcaca aaggactcat ctgctacttg ggaacagagc 
ctcctagaaa aattttccac tgaactttac cagcaactga ataacctgga agcctgcgtg 
atacaggagg ttggggtgga agagactccc ctgatgaatg tggactccat cctggctgtg 3 60 

420 
480 



180 
240 

300 



aagaaatact tccgaagaat cactctctat ctgacagaga agaaatacag cccttgtgcc 
tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 



25 agattaagga ggaaggaa 498 

<210> 28 
<211> 498 
<212> DNA 
30 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic DNA 

35 <220> 

<223> Clone ID 2D7 

<400> 28 

tgtgatctgc ctcagaccca cagccttggt aacaggcggg ccttgatact cctggcacaa 60 

40 atgggaagaa tctctccttt ctcctgtctg aaggacagac atgacttcag atttccccag 

gaggagtttg atggcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 

atccagcaga ccttcaatct cttcagcaca aaggactcat ctgctacttg ggaacagagc 

ctcctagaaa aattttccac tgaactttac cagcaactga ataacctgga agcttgcgtg 

atacaggagg ttggggtgga agagactccc ctgatgaatg tggactctat cctggctgtg 360 



120 
180 

240 
300 



45 aagaaatact tccaaagaat cactctttat ctgacagaga ggaaatacag cccttgtgcc 
tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 



420 
480 

agattaagga ggaaggaa 

<210> 29 
50 <211> 498 
<212> DNA 

<213> Artificial Sequence 

<220> 

55 <223> Description of Artificial Sequence: Synthetic DNA 

' <220> 

<223> Clone ID 2D9 
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<400> 29 • . 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatact cctggcacaa 60 
atgggaagaa tctctccttt ctcctgcctg aaggacagac atgactctgg attcccccag 120 
gaggagcttg atggcaacca gttccagaag gctcaagcca tctctgtcct ccacgagatg 180 
5 atccagcaga ctttcaatcc cttcagcaca aaggactcac ctgctacttg ggaacagagc 240 
ctcctagaaa aattttccac tgaacttaac cagcagctga atgacctgga agcctgcgtg 300 
atacaggagg ttggggtgga agagactccc ctggtgaatg tggaccccac cctggctgtg 360 
aagaaatact tccaaagaat cactcctcat ccgacagaga agaaatacag cccttgtgcc 420 
tgggaggctg tcagagcaga aatcatgaga tctctctctt tttcaacaaa cttgcaaaaa 480 
10 agattaagga ggaaggaa 498 

<210> 30 • 

<211> 498 

<212> DNA 

15 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic DNA 

20 <220> : 

<223> Clone ID 2DA2 

<400> 30 • ' 

tgtgatctgc ctcagaccca cagccttggt aacaggaggc ccttgatact cctggcacaa 60 

25 atgggaagaa tctctccttt ctcctgcctg aaggacagac aggacttcgg attcccccag 120 

gaggagtttg atggcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 180 

atgcagcaga ccttcaatct cttcagcaca aagaactcat ctgctgcttg ggaacagagc 240 

ctcctagaaa aattttccac tgaacCccac cagcaactga atgaactgga agcatgtgtg 300 

atacaggagg ttggggtgga agagactccc ctgatgaatg tggactccat cctggctgtg 360 

30 aagaaatact tccaaagaat cactctttat ctaatagaga ggaaatacag cccttgtgca 420 

tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 

agattaagga ggaaggaa 498 



<210> 31 

35 <211> 498 

<212> DNA 

<213> Artificial Sequence 
<220> 

40 <223> Description of Artificial Sequencer Synthetic DNA 
<220> 

<223> Clone ID 2DH9 

45 <400> 31 

tgtgatctgc ctcagaccca cagccctggt 

atgggacgaa tctctccttt ctcctgcctg 

ggggagtttg atggcaacca gttccagaag 

atgcagcaga ccttcaatct cttcagcaca 

50 ctcctagaaa aattttccac tgaactctac 

atacaggagg ttggggtgga agagaccccc 

aggaagtact tccaaagaat cactctttat 

tgggaggttg tcagagcaga aatcatgaga 

agattaagga ggaaggaa 

55 

<210> 32 
<211> 498 
<212> DNA 

<213> Artificial Sequence 



aacaggaggg ccttgatgct cctggcacaa 60 
aaggacagat atgatttcgg attcccccag 120 
gctcaagcca tctctgtcct ccatgagatg 180 
aaggattcat ctgctgcttg ggaacagagc 240 
cggcagctga atgacctgga agcctgtgtg 300 
ctgatgaatg tggactccat cctggctgtg 360 
ctgacagaga agaagcatag cccttgttcc 420 
tctttctctt tttcaacaaa cttgcaaaaa 480 

. 498 
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<220> 

<223> Description of Artificial Sequence: Synthetic DNA 

5 <220> 

<223> Clone ID 2G11 

<400> 32 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatact cctggcacaa 
10 atgggaagaa tctctccttt ctcctgcctg aaggacagac atgactttgg acttccccag 

gaggagtttg atggcaacca gttccagaag actcaagcca tctctgtcct ccatgagatg 180 

240 
300 

atacaggagg ttggggtgga agagactccc ctgatgaatg tggactccat cctggctgtg 360 
15 agaaaatact tccaaagaat cactctttat ctgacagagg agaaatacag cccttgtgcc 42 0 
tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 
agattaagga ggaaggaa 



45 



60 
120 



atccagcaga ccttcaatct cttcagcaca aaggactcat ctgatacttg ggaacagagc 
ctcctagaaa aattctacat tgaacttttc cagcagctga atgacctgga agcctgcgtg 



480 
498 



60 



<210> 33 
20 <211> 498 
<212> DNA. 

<213> Artificial Sequence 
<220> 

25 <223> Description of Artificial Sequence: Synthetic DNA. 
<220> 

<223> Clone ID 2G12 
30 ' <400> 33 

tgtgatctgc ctcagaccca cagccttggt aacaggagga ctttgatgct catggcacaa 
atgaggagaa tctctccttt cccccgcctg aaggacagat atgatttcgg attcccccag 120 
gaggtgtttg atggcaacca gttccagaag gctcaagcta tcttcctttt ccatgagatg 180 
atgcagcaga ccttcaatct cttcagcaca aagaactcat ctgctgcttg ggatgagacc 240 
35 ctcctagaca aattctacac tgaactctac cagcagctga atgacttgga agcctgtgtg 300 
atgcaggagg ggagggtggg agaaactccc ctgatgaatg cggactccat cttggctgtg 3 60 
aagaaatact tccgaagaat cactctctat ctgacagaga agaaatacag cccttgtgcc 420 
tgggaggctg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 
agattaagga ggaaggaa 

40 

<210> 34 
<211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<220> 

50 <223> Clone ID 2H9 
<400> 34 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatact cctggcacaa 60 
atgggaagaa tctctccttt ctcctgcctg aaggacagac atgactttgg attcccccag 120 
55 gaggagtttg atggcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 



180 



atccagcaga ccttcaatct cttcagcaca aaggactcat ctgctacttg ggaacagagc 240 
ctcctagaaa aattttccac tgaacttaac cagcagctga atgacctaga agcctgtgtg 300 
acacaggagg ttggggtgga agagactccc ctgatgaatg aggactctat cctggctgtg 360 
aagaaatact tccaaagaat cactctttat ctgacagaga agaaatacag cccttgtgcc 420 
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tgggaggttg tcagagcaga aatcatgaga tctctctctt tttcaacaaa cttgcaaaaa 480 
agattaagga ggaaggaa 498 

<210> 35 
5 <211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

10 <223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID 6BC11 
15 <400> 35 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatact cctggcacaa 60 
atgggaagaa tctctccttt ctcctgcctg aaggacagat atgatttcgg attcccccag 120 
gaggagtttg atggcaacca gctccagaag gctcaagcca tctctgtcct ccatgagatg 180 
atccagcaga ccttcaatct cttcagcaca aaggattcat ctgctgcttg ggaacagagc 240 
20 ctcctagaaa aattttccac tgaacttaac cagcagctga atgacctgga agcctgcgtg 300 
atacaggagg ttggagtgga agagactccc ctgatgaatg tggactccat cctggctgtg 360 
aagaaatact tccaaagaat cactctttat ctgacagaga ggaaatacag cccttgtgcc 420 
tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 
agattaagga ggaaggaa 498 

25 

<210> 36 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

35 <223> Clone ID 2DH12 
<400> 36 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu Met 
15 10 15 



30 



40 



55 



Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
20 25 30 



Arg Gin Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
45 35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 

50 Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Glu Gin Thr 
65 70 75 80 



Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 
85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Val Lys Glu Thr Pro Leu Met 
100 105 110 

Asn Val Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr 
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115 120 125 

Leu Tyr Leu He Glu Arg Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

5 

Arg Ala Glu He Met Arg Ser Phe Ser. Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

Arg Leu Arg Arg Lys Glu 
10 165 



<210> 37 
<211> 166 
15 <212> PRT 

<213> Artificial Sequence 

<220>. 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID 2CA3 



20 



<400> 37 

25 Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asp Arg Arg Ala Met He 
15 10 15 



30 



45 



Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
20 25 30 

Arg Tyr Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 



Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
35 50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Glu Gin Ser 

65 - 70 : 75 . .80 

40 Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Glu Leu 

85 90 . 95 



Glu Ala Cys Val He Gin Glu Val Gly Val Gly Glu Thr Pro Leu Met 
100 105 110 

Asn Gly Asp Ser He Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 
115 120 125 



Leu Tyr Leu He Glu Arg Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

50 130 135 , 140. 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 

145 150 15.5 160 

55 Arg Leu Arg Arg Lys Glu 

165 



<210> 38 



15 
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<211> 166 

<212> PRT 

<213> Artificial Sequence 
5 <220> 

<223> Description of Artificial Sequence: Synthetic amino acid 



10 



20 



35 



45 



50 



<220> 

<223> Clone ID 4AB9 
<400> 38 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
15 10 15 



15 Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
20 25 30 



Arg His Asp Phe Gly Phe Pro Arg Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met Met Gin Gin Thr 
50 55 60 



Phe Asn Leu Phe Ser Thr Lys Asn Ser Ser Ala Ala Trp Asp Glu Thr 

25 65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Glu Leu 

85 90 95 

30 Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 



Asn Glu Asp Ser He Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 
115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ser Trp Glu Val Val 

130 135 140 



Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
40 145 150 155 160 



Arg Leu Arg Arg Lys Glu 
165 



<210> 39 
<211> 166 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 



<220> 

55 <223> Clone ID 2DA4 
<400> 39 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu Met 
1 5 10 15 
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Leu Leu Ala Gin Met Gly Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

5 Arg Gin Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Ser Asn Gin Phe 

35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met Met Gin Gin Thr 

50 55 60 



10 



25 



40 



50 



Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 



Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 

15 85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
100 105 110 

20 Asn Val Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr 
115 120 125 



Leu Tyr Leu He Glu Arg Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 

145 150 155 160 



Arg Leu Arg Arg Lys Glu 
30 165 



<210> 40 
<211> 166 
35 <212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic ammo acid 
<220> 

<223> Clone ID 3DA11 



<400> 40 

45 Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu Val 
15 10 15 



Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg Tyr Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 

35 40 45 



Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 

55 50 55 60 

V Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 

65 70 75 80 
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Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 
85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
5 100 105 110 

Asn Glu Asp Ser He Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 
115 120 125 

10 Leu Tyr Leu He Glu Arg Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 



15 



30 



40 



55 



Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

Arg Leu Arg Arg Lys Glu 
165 



20 <210> 41 

<211> 166 
<212>.PRT 

<213> Artificial Sequence 
25 <220> 

<223> Description of Artificial Sequence: Synthetic amino acid 



<220> 

<223> Clone ID 2DB11 
<400> 41 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu Met 
15 10 15 



35 Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
20 25 30 



Arg Tyr Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 



Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr 
45 65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 
85 90 95 

50 Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
100 105 110 



Asn Val Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr 
115 120 125 

Leu Tyr Leu He Glu Arg Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
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145 150 155 160 

Arg Leu Arg Arg Lys Glu 
165 



<210> 42 
<211> 166 
<212> PRT 
10 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 

15 <220> 

<222> Clone ID 2CA5 

<400> 42 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu lie 
20 1 5 10 15 

Leu Leu Ala Gin Met Gly Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 
20 25 30 

25 Arg Gin Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Arg Phe 
35 40 45 



30 



45 



Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 

50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asn Ser Ser Ala Ala Trp Glu Gin Ser 

65 70 75 80 



Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 
35 85 90 . 95 

Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
100 105 110 

40 Asn Glu Asp Ser He Leu. Ala Val Lys Lys Tyr Phe Gin Arg He Thr 
115 120 125 . 



Leu Tyr Leu He Glu Arg Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 .160 



Arg Leu Arg Arg Lys Glu 
50 165 



<210> 43 
<211> 166 
55 <212> PRT 

<213> Artificial Sequence 

s 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
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<220> 

<223> Clone ID 2G6 
5 <400> 43 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
15 10 15 

Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
10 20 25 30 

Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

15 Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Glu Gin Ser 
65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Asn Gin Gin Leu Asn Asp Leu 
85 90 95 



20 



Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
25 100 105 110 

Asn Val Asp Pro He Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 
115 120 125 

30 Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 



35 



50 



Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

Arg Leu Arg Arg Lys Glu 
165 



40 <210> 44 
<211> 156 
<212> PRT 

<213> Artificial Sequence 
45 <220> 

<223> Description of Artificial Sequence: Synthetic amino acid 



<220> 

<223> Clone ID 3AH7 
<400> 44 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
15 10 15 



55 Leu Leu Ala Gin Met* Arg Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
.20 25 30 

Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Ser Asn Gin Phe 
35 -40 45 
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Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Glu Gin Ser 
65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu His Gin Gin Leu Asn Glu Leu 
85 90 95 

Glu Ala Cys Val Val Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
100 105 110 

Asn Glu Asp Ser He Leu Ala Val Lys Lys Tyr Leu Gin Arg He Thr 
115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

Arg Leu Arg Arg Lys Glu 
165 



<210> 45 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID 2G5 
<400> 45 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu Met 
1 5 10 15 

Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
20 25 30 ' 

Arg Gin Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Glu Gin Ser 
65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 
85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
100 105 110 
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Asn Val Asp Ser lie Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr 
115 120 125 

Leu Tyr Leu He Glu Arg Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
5 130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

10 Arg Leu Arg Arg Lys Glu 

165 



<210> 46 
15 <211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

20 <223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID 2BA8 
25 <400> 46 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
15 10 15 

Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
30 20 25 30 

Arg Tyr Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

35 Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Glu Gin Ser 
65 70 . 75 80 

40 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 
85 • 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
45 100 105 110 

Asn Val Asp Ser He Leu Ala Val Arg Lys Tyr . Phe Gin Arg He Thr 
115 120 125 

50 Leu Tyr Leu He Glu Arg Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn* Leu Gin Lys 
145 150 155 160 

55 

Arg Leu Arg Arg Lys Glu 
165 
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<210> 47 
<211> 166 
<212> PRT 

<213> Artificial Sequence 

5 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

10 <223> Clone ID 1F3 
<400> 47 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
15 10 15 



15 



30 



Leu Leu Gly Gin Met Gly Arg He Ser His Phe Ser Cys Leu Lys Asp 
20 25 30 



Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
20 35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 

25 Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Val Ala Trp Asp Glu Arg 

65 70 75 80 



Leu Leu Asp Lys Leu Tyr Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 
85 90 95 

Glu Ala Cys Val Met Gin Glu Val Trp Val Gly Gly Thr Pro Leu Met 
100 105 110 



Asn Glu Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr 
35 115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

40 Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 



Arg Leu Arg Arg Lys Glu 
165 



45 



<210> 48 
<211> 166 
<212> PRT 
.50 <213> Artificial Sequence. 

<220> 

<223> Description of Artificial Sequence: Synthetic ainino acid 

55 <220> 

<223> Clone ID 4BE10 

<400> 48 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
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1 • .5 10 15 

Leu Leu Ala Gin ,Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
20 ' 25 30 

5 ' 

Arg Tyr Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu He Met Gin Gin Thr 
10 50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asn Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

15 Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Glu Leu 

85 90 95 

Glu Ala Cys Val He Gin Gly Val Gly Val Glu Glu Thr Pro Leu Met 
100 105 110 

20 

Asn Glu Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr 
115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ser Trp Glu Val Val 
25 130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

30 Arg Leu Arg Arg Lys Glu 

165 



<210> 49 
35 <211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

40 <223> Description of Artificial Sequence: Synthetic amino acid 

<220> ^■ 
<223> Clone ID 2DD9 

45 <400> 49 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu Met 
15 10 15 

Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
50 . 20 25 30 

Arg Tyr Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

55 Gin Lys Ala .Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 

Phe Asn Leu" Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Glu Gin Ser 
65 70 75 80 
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Leu Leu Glu Lys Phe Ser Thr Gly Leu Tyr Gin Gin Leu Asn Asp Leu 
85 90 95 

5 Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
100 105 110 

Asn Glu Asp Ser He Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 
115 120 125 



10 



35 



50 



Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ser Trp Glu Val Val 
130 135 140 



Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
15 145 150 155 160 

Arg Leu Arg Arg Lys Glu 
165 

20 

<210> 50 
<211> 166 
<212> PRT 

<213> Artificial Sequence 

25 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

30 <223> Clone ID 3CA1 

<400> 50 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
1 5 10 15 



Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
20 25 30 



Arg His Asp Phe Gly Leu Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 

40 35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 

50 55 60 

45 Phe Asn Leu Phe Ser Thr Lys Asn Ser Ser Ala Ala Trp Asp Glu Thr 

65 70 75 80 



Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asn Leu 
85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Met Glu Glu Thr Pro Leu Met 
100 105 110 



Asn Val Asp Ser He Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 
55 lis 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

' 130 135 140 
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Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

Arg Leu Arg Arg Lys Glu 
165 



<210> 51 
<211> 166 
10 <212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID 2F8 



15 



<400> 51 

20 Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
1 5 10 15 



25 



40 



Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
20 25 30 

Arg Tyr Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 



Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met Met Gin Gin Thr 
30 50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asn Ser Ser Ala Ala Trp Asp Glu Thr 
65 70 75 80 

35 Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Glu Leu 

85 90 95 



Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
100 105 110 

Asn Glu Asp Ser He Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 
115 120 125 



Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ser Trp Glu Val Val 
45 130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

50 Arg Leu Arg Arg Lys Glu 

165 



<210> 52 
55 <211> 166 
<212> PRT 
^ <213> Artificial Sequence 

<220> 
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<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID 6CG3 

5 

<400> 52 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Lys Arg Ala Met Met 
15 10 15 

10 Leu Leu Ala Gin Met Gly Arg Thr Ser Pro Phe Ser Cys Leu Lys Asp 
20 25 30 



15 



30 



40 



45 



55 



Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 

35 40 45 

Gin Arg Ala Gin Ala lie Phe Val Leu His Glu Met lie Gin Gin Thr 

50 55 60 



Phe Asn Phe Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Glu Gin Ser 
20 65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Asn Gin Gin Leu Asn Asp Leu 
85 90 95 

25 Glu Ala Cys Val lie Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
100 105 110 



Asn Glu Asp Ser lie Leu Ala Val Lys Lys Tyr Phe Gin Arg lie Thr 
115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 



Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
35 145 150 155 160 



Arg Leu Arg Arg Lys Glu 
165 



<210> 53 

<211> 166 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

50 <223> Clone ID 3CG7 

<400> 53 T X- 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Ser Arg Ala Leu Met 



15 10 15 

Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
20 25 30 

Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
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35. 



40 



45 



Gin Lys Ala Oln Ala lie Ser Ala Phe His Glu Met He Gin Gin Thr 
50 - 55 60 

5 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Glu Gin Asn 
65 70 .75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asn Leu 
10 85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Met Glu Glu Thr Pro Leu Met 
100 105 110 

15 Asn Val Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr 
115 120 125 



20 



Leu Tyr Leu He Glu Arg Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

Arg Ala GLu- He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 



Arg Leu Arg Arg Lys Glu 
25 165 



<210> 54 
<211> 166 
30 <212> PRT 

<213> Artificial Sequence 



35 



45 



<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID 1D3 



<400> 54 

40 Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
1 5 10 15 



Leu Leu Ala Gin Met Gly Arg He Ser His Phe Ser Cys Leu Lys Asp 
20 25 30 

Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly His Gin Phe 
35 40 45 



Gin Lys Thr Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 . 50 . 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Glu Gin Ser 
65 .70 .75 80 

55 Leu Leu Glu Lys Phe STer Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 

85 90 95 

Glu Ala Cys Val Vile Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
LOO/ ■ 105 110 
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Asn Glu Asp' Ser He 
115 

5 Leu Tyr Leu Met Glu 
130 

Arg Ala Glu He Met 
145 

10 

Arg Leu Arg Arg Lys 
165 



Leu Ala Val Lys Lys Tyr 
120 

Lys Lys Tyr Ser Pro Cys 
135 

Arg Ser Phe Ser Phe Ser 
150 155 

Glu 



Phe Gin Arg He Thr 
125 

Ala Trp Glu Val Val 
140 

Thr Asn Leu Gin Lys 
160 



15 <210> 55 
<211> 166 
<212> PRT 

<213> Artificial Sequence 

20 <220> 

<223> Description of Artificial Sequence: Synthetic amino acid 



25 



35 



50 



<220> 

<223> Clone ID 2G4 
<400> 55 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Met Met 
1 5 10 15 



30 Leu Leu Ala Gin Met Ser Arg He Ser Pro Ser Ser Cys Leu Met Asp 
20 25 30 



Arg His Asp Phe Glu Phe Pro Gin Glu Glu Phe Asp Asp Lys Gin Phe 
35 40 45 

Gin Lys Ala Pro Ala He Ser Val Leu His Glu Val He Gin Gin Thr 
50 55 60 



Phe Asn Leu Phe Ser Thr Glu Asp Ser Ser Ala Ala Trp Glu Gin Thr 
40 65 . 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 
85 90 95 

45 Glu Ala Cys Val Met Gin Glu Glu Arg Val Gly Glu Thr Pro Leu Met 
100 105 110 



Asn Ala Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr 
115 120 125 

Leu Tyr Leu Thr Lys Lys Lys Tyr Ser Pro Cys Ser Trp Glu Val Val 

. 130 135 140 



Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
55 145 150 . 155 160 

^ Arg Leu Arg Arg Lys Glu 
165 
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<210> 56 • . . 
<211> 166 
<212> PRT 
5 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 

10 <220> 

<223> Clone ID lAl 

<400> 56 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
15 1 5 10 15 

Leu Leu Ala Gin Met Gly Arg He Ser His Phe Ser Cys Leu Lys Asp 
20 25 30 

20 Arg Tyr Asp Phe Gly Phe Pro Gin Glu Val Phe Asp Gly Asn Gin Phe 
35 40 45 

Gin Lys Ala Gin Ala He Ser Ala Phe His Glu Met Met Gin Gin Thr 
50 55 60 

Phe Asn Leu Phe Ser Thr Glu Asp Ser Ser Ala Ala Trp Glu Gin Ser 
65 70 75 80 



25 



Leu Leu Glu Lys Phe Ser Thr Glu Leu His Gin Gin Leu Asn Asp Leu 
30 85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
100 105 110 

35 Asn Glu Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr 
115 120 125 



40 



Leu Tyr Leu Met Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 



Arg Leu Arg Arg Lys Glu 
45 165 



<210> 57 
<211> 166 
50 <212> PRT . 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 

55 

<220> ' . 

<223> Clone ID IDIO 

V 

<400> 57 



30 
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Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
15 10 15 

Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
5 20 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Leu 
35 40 45 

10 Gin Lys Thr Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 



15 



30 



45 



55 



Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Glu Gin Ser 

65 70 . 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Asn Gin Gin Leu Asn Asp Leu 
85 90 95 



Glu Ala Cys Val He Gin Gly Val Gly Val Glu Glu Thr Pro Pro Met 
20 100 105 110 

Asn Val Asp Ser He -Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 
115 120 125 

25 Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 



Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

Arg Leu Arg Arg Lys Glu 
165 



35 <210> 58 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
40 <220> 

<223> Description of Artificial Sequence: Synthetic amino acid 



<220> 

<223> Clone ID 1F6 

<400> 58 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Thr Leu Met 
1 5 10 15 



50 He Met Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
20 25 30 



Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 . 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Glu Gin Ser 
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65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Asn Gin Gin Leu Asn Asp Leu 
85 90 95 

5 

Glu Ala Cys Val lie Gin Glu Ala Gly Val Glu Glu Thr Pro Leu Met 
100 105 110 

Asn Val Asp Ser lie Leu Ala Val Lys Lys Tyr Phe Gin Arg lie Thr 
10 115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

15 Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 



20 



45 



Arg Leu Arg Arg Lys Glu 
165 



<210> 59 
<211> 166 
<212> PRT 
25 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 

30 <220> 

<223> Clone ID 2A10 

<400> 59 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
35 1 5 10 15 

Leu Leu Ala Gin Met Gly Arg He Ser His Phe Ser Cys Leu Lys Asp 
20 25 30 

40 Arg Tyr Asp Phe Gly Phe Pro Gin Glu Val Phe Asp Gly Asn Gin Phe 
35 40 45 



Gin Lys Ala Gin Ala He Ser Ala Phe His Glu Met He Gin Gin Thr 
50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Glu Gin Ser 
65 70 75 80 



Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asn Leu 
50 85 90 95 

Glu Ala Cys Val He Gin Glu Vai Gly Val Glu Glu Thr Pro Leu Met 
100 105 110 

55 Asn Glu Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr 
115 120 125 

Leu Tyr Leu Met Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 
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Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

5 Arg Leu Arg Arg Lys Glu 

165 



<210> 60 
10 <211> 166 
<212> PRT 

<213> Artificial Sequence 

<220> 

15 <223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID 2C3 
20 <400> 60 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu lie 

1 5 10 15 

Leu Leu Ala Gin Met GLy Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 
25 20 25 30 

Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Ser 
35 40 45 

30 Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 



35 



50 



Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Asp Thr Trp Asp Ala Thr 

65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Asn Gin Gin Leu Asn Asp Leu 

85 90 95 



Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

40 100 . 105 . 110 

Asn Val Asp Ser He Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 

115 .120 125 • 

45 Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 



Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 

145 150 155 160 

Arg Leu Arg Arg Lys Glu 
165 



55 <210> 61 

<211> 166 

^ <212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

5 <223> Clone ID 2D1 
<400> 61 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
15 10 15 



10 



25 



Leu Leu Ala Gin Met Arg Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
20 25 30 



Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
15 35 40 45 

Gin Lys Ala Gin Ala He Ser Ala Phe His Glu Met He Gin Gin Thr 
50 55 60 

20 Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Glu Gin Ser 
65 70 75 80 



Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asn Leu 
85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Met Glu Glu Thr Pro Leu Met 
100 105 110 



Asn Glu Asp Ser He Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 
30 115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

35 Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 



Arg Leu Arg Arg Lys Glu 
165 

40 . 

<210> 62 
<211> 166 
<212> PRT 
45 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic amino- acid 

50 <220> 

<223> Clone ID 2D10 

<400> 62 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
55 1 . 5 10 15 

' Leu Leu Ala Gin Met Gly Arg Val Ser Pro Phe Ser Cys Leu Lys Asp 
20 25 30 
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Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 

35* 40 45 

Gin Lys Ala Gin Ala He Ser Ala Phe His Glu Met He Gin Gin Thr 
5 50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Glu Gin Ser 
65 70 75 80 

10 Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asn Leu 

85 90 95 



15 



55 



Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
100 105 110 

Asn Val Asp Ser He Leu Ala Val Lys Lys Tyr Phe Arg Arg He Thr 
115 120 125 



Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
20 130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

25 Arg Leu Arg Arg Lys Glu 

165 



<210> 63 
30 <211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

35 <223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID 2D7 
40 <400> 63 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
15 10 15 

Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
45 20 7 25 30 

Arg His Asp Phe Arg Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

50 Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Glu Gin Ser 
65. 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Gin Gin Leu Asn Asn Leu 
85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
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• 100 105 110 

Asn Val Asp Ser lie Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 
115 120 125 

5 

Leu Tyr Leu Thr Glu Arg Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
10 145 150 155 160 

Arg Leu Arg Arg Lys Glu 
165 

15 

<210> 64 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

25 <223> Clone ID 2D9 
<400> 64 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
15 10 15 



20 



30 



45 



Leu Leu Ala Gin Met Gly Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 



Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
.50 55 60 

40 Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Glu Gin Ser 
65 70 75 80 



Leu Leu Glu Lys Phe Ser Thr Glu Leu Asn Gin Gin Leu Asn Asp Leu 
85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Val 
100 105 110 



Asn Val Asp Ser He Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 
50 115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

55 Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
^ 145 150 155 160 

Arg Leu Arg Arg Lys Glu 
165 
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<210> 65 
<211> 166 
5 <212> PRT 

<213> Artificial Sequence 



10 



20 



35 



<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 

<220> 

<223> Clone ID 2DA2 



<400> 65 

15 Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Pro Leu He 
15 10 15 



Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
20 25 30 

Arg Gin Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 



Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met Met Gin Gin Thr 
25 50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asn Ser Ser Ala Ala Trp Glu Gin Ser 
65 70 75 80 

30 Leu Leu Glu Lys Phe Ser Thr Glu Leu His Gin Gin Leu Asn Glu Leu 

85 90 95 



Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
100 105 110 

Asn Val Asp Ser He Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 
115 120 125 



Leu Tyr Leu He Glu Arg Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
40 130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

45 Arg Leu Arg Arg Lys Glu 

165 



<210> 66 

50 <211> 166 

<212> PRT 

<213> Artificial Sequence 
<220> 

55 <223> Description of Artificial Sequence: Synthetic amino acid 

^ <220> 

<223> Clone ID 2DH9 
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<400> 66 

Cys Asp Leu Pro Gin Thr His Ser Pro Gly Asn Arg Arg Ala Leu Met 
15 10 15 

5 Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 

20 25 30 

Arg Tyr Asp Phe Gly Phe Pro Gin Gly Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

10 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met Met Gin Gin Thr 
50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Glu Gin Ser 
15 65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Tyr Arg Gin Leu Asn Asp Leu 
85 90 95 

20 Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
100 105 110 

Asn Val Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr 
115 120 125 

25 

Leu Tyr Leu Thr Glu Lys Lys His Ser Pro Cys Ser Trp Glu Val Val 
130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
30 145 150 155 160 

Arg Leu Arg Arg Lys Glu 
165 



35 



40 



50 



<210> 67 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 



<220> 

45 <223> Clone ID 2G11 
<400> 67 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu lie 
15 10 15 



Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
20 25 30 



Arg His Asp Phe Gly Leu Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
55 35 40 • -« 

Gin Lys Thr Gin ; Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 
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Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Asp Thr Trp Glu Gin Ser 
65 70 75 80 

Leu Leu Glu Lys Phe Tyr lie Glu Leu Phe Gin Gin Leu Asn Asp Leu 
5 85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
100 105 110 

10 Asn Val Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr 
115 120 125 



15 



30 



Leu Tyr Leu Thr Glu Glu Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 

145 150 155 160 



Arg Leu Arg Arg Lys Glu 
20 165 



<210> 68 
<211> 166 
25 <212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID 2G12 



<400> 68 

35 Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Thr Leu Met 
1 - 5 10 15 

Leu Met Ala Gin Met Arg Arg He Ser Pro Phe Pro Arg Leu Lys Asp 
20 25 30 

40 

Arg Tyr Asp Phe Gly Phe Pro Gin Glu Val Phe Asp Gly Asn Gin Phe 
35 40 45 

Gin Lys Ala Gin Ala He Phe Leu Phe His Glu Met Met Gin Gin Thr 
45 50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asn Ser Ser Ala Ala Trp Asp Glu Thr 

65 70 75 80 

50 Leu Leu Asp Lys Phe Tyr Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 

85 90 95 

Glu Ala Cys Val Met Gin Glu Gly Arg Val Gly Glu Thr Pro Leu Met 
100 105 110 

55 

Asn Ala Asp Ser He Leu Ala Val Lys Lys Tyr Phe Arg Arg He Thr 
115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Ala Val 
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130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

5 

Arg Leu Arg Arg Lys Glu 
165 



10 <210> 69 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
15 <220> 

<223> Description of Artificial Sequence: Synthetic amino acid 



20 



30 



45 



55 



<220> 

<223> Clone ID 2H9 
<400> 69 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
1 5 10 15 



25 Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
.20 25 30 



Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 



Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Glu Gin Ser 
35 65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Asn Gin Gin Leu Asn Asp Leu 
85 90 95 

40 Glu Ala Cys Val Thr Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
100 105 110 



Asn Glu Asp Ser He Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 
115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 



Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
50 145 150 155 160 



Arg Leu Arg Arg Lys Glu 
165 



<210> 70 
<211> 166 
<212> PRT 

<213> Artificial Sequence 



40 
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<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 

5 <220> 

<223> Clone ID 6BC11 

<400> 70 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu lie 
10- 15 10 15 

Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
20 25 30 

15 Arg Tyr Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Leu 
35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 

20 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Glu Gin Ser 
65 70 75 80 

Leu Leu Glu Lys Phe Ser Thr Glu Leu Asn Gin Gin Leu Asn Asp Leu 
25 85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
100 105 110 

30 Asn Val Asp Ser He Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 
115 120 125 

Leu Tyr Leu Thr Glu Arg Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
• 130 135 140 

35 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

Arg Leu Arg Arg Lys Glu 
40 165 



<210> 71 
<211> 166 
45 <212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 

50 

<220> 

<223> Clone ID tl9bb 

. <220> 
55 <221> MOD_RES 
<222> (11) 
' <223> N or D 

<220> 
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<221> MOD^RES 
<222> (12) 
<223> R, S, or 

5 <220> 

<221> MOD^RES 
<222> (15) 
<223> L or M 

10 <220> 

<221> MOD_RES 
<222> (16) 
<223> I, M or V 

15 <220> 

<221> MOD.RES 
<222> (19) 
<223> A or G 

20 <220> 

<221> MOD_RES 
<222> (22) 
<223> G or R 

25 <220> 

<221> MOD^RES 
<222> (24) 
<223> I or T 

30 <220> 

<221> MOD^RES 
<222> (26) 
<223> P or H 

35 <220> 

<221> MOD^RES 
<222> (34) 
<223> H, Y or Q 

40 <220> 

<221> MOD_RES 
<222> (38) 
<223> F or L 

45 <220> 

<221> MOD_RES 
<222> (40) 
<223> Q or R 

50 <220> 

<221> MOD_RES 
<222> (45) 
<223> G or S 

55 <220> ■ • 

^ <221> MOD_RES 

<222> (4&):. 

<223> N orv» 
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<220> 

<221> MOD^RES 
<222> (47) 
<223> Q or R 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



<220> 

<221> MOD^RES 

<222> (50) 

<223> K or R 

<220> 

<221> MOD^RES 
<222> (51) 
<223> A or T 

<220> 

<221> MOD_RES 
<222> (55) 
<223> S or F 

<220> 

<221> MOD_RES 

<222> (56) 
<223> V or A 

<220> 

<221> MOD^RES 
<222> (57) 
<223> L or F 

<220> 

<221> MOD^RES 
<222> (60) 
<223> M or I 

<220> 

<221> MOD_RES 
<222> (61) 
<223> I or M 

<220> 

<221> MOD.RES 
<222> (67) 
<223> L or F 

<220> 

<221> MOD_RES 
<222> (72) 
<223> D or N 

<220> 

<221> MOD^RES 

<222> (75) 

<223> A or V 

<220> 

<221> MOD.RES 

<222> (76) 

<223> A or T 
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<220> 

<221> MOD^RES 
<222> {78) 
5 <223> E or D 

<220> 

<221> MOD_RES 
<222> {79) 
10 <223> 0 or E 

<220> 

<221> MOD.RES 
<222> (80) 
15 <223> S, R, T, or N 

<220> 

<221> MOD^RES 
<222> (83) 
20 <223> E or D 

<220> 

<221> MOD_RES 
<222> (85) 
25 <223> F or L 

<220> 

<221> MOD_RES 
<222> (86) 
30 <223> S or Y 

<220> 

<221> MOD_RES 
<222> (88) 
35 <223> E or G 

<220> 

<221> MOD.RES 

<222> (90) 

40 <223> Y. H, N 

<220> 

<221> MOD_RES 
<222> (95) 
45 <223> D, E, or N 

<220> 

<221> MOD_RES 
<222> (101) 
50 <223> I, M, or V 

<220> 

<221> MOD_RES 
<222> (103) 
55 <223> E or G 

' <220> 

<221> MOD_RES 
<222> (105) 
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<223> G or W 
<220> 

<221> MOD^RES 
5 <222> (106) 
<223> V or M 

<220> 

<221> MOD_RES 
10 <222> (107) 

<223> E, G, or K 

<220> 

<221> MOD_RES 
15 <222> (108) 
<223> E or G 

<220> 

<221> MOD^RES^ 
20 <222> (114) 

<223> V. E. or G 

<220> 

<221> MOD_RES 
25 <222> (116) 
<223> S or P 

<220> 

<221> MOD^RES 
30 <222> (121) 
<223> K or R 

<220> 

<221> MOD_RES 
35 <222> (124) 
<223> F or L 

<220> 

<221> MOD_RES 
40 <222> (132) 

<223> T, I, or- M 

<220> 

<221> MOD_RES 
45 <222> (134) 
<223> K or R 

<220> 

<221> MOD^RES 
50 <222> (140) 
<223> A or S 

<400> 71 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Xaa Xaa Arg Ala Xaa Xaa 
55 1 5 10 15 

^ Leu Leu Xaa Gin Met Xaa Arg Xaa Ser Xaa Phe Ser Cys Leu Lys Asp 
20 25 30 
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Arg Xaa Asp Phe Gly Xaa Pro Xaa Glu Glu Phe Asp Xaa Xaa Xaa Phe 
35 40 45 

Gin Xaa Xaa Gin Ala He Xaa Xaa Xaa His Glu Xaa Xaa Gin Gin Thr 
5 50 '55 60 

Phe Asn Xaa Phe Ser Thr Lys Xaa Ser Ser Xaa Xaa Trp Xaa Xaa Xaa 
65 70 75 80 

10 Leu Leu Xaa Lys Xaa Xaa Thr Xaa Leu Xaa Gin Gin Leu Asn Xaa Leu 

85 90 95 



15 



Glu Ala Cys Val Xaa Gin Xaa Val Xaa Xaa Xaa Xaa Thr Pro Leu Met 

100 105 110 

Asn Xaa Asp Xaa He Leu Ala Val Xaa Lys Tyr Xaa Gin Arg He Thr 

115 120 125 



Leu Tyr Leu Xaa Glu Xaa Lys Tyr Ser Pro Cys Xaa trp Glu Val Val 

20 130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 

145 150 155 160 



25 Arg Leu Arg Arg Lys Glu 

165 



<210> 72 
30 <211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

35 <223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID CHl.l 
40 <400> 72 

tgtgatctgc ctcagaccca cagcfcttggt aacaggaggg ccttgatact cctggcacaa 60 
atgggaagaa tctctccttt ctcctgtctg atggacagac atgactttgg atttccccag 120 
gaggagtttg atgacaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 180 
atccaacaga ccttcaatct cttcagcaca aaggactcat ctgctacttg ggatgagaca 240 
45 cttctagaca aattctacac tgaactttac cagcagctga atgacctgga agcctgcgtg 300 
atacaggagg ttggggtgga agagactccc ctgatgaatg aggactccat cttggctgtg 360 
aagaaatact tccgaagaat cactctctat ctgacagaga agaaatacag cccttgtgcc 420 
tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 
agattaagga ggaaggaa 

50 

<210> 73 
<211> 498 
<212> DNA 

<213> Artificial Sequence 

55 

V <220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<220> 
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<223> Clone ID CHl.2 
<400> 73 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatact cctggcacaa 60 
atgggaagaa tctctccttt ctcctgcctg aaggacagac atgactttgg attcccccag 120 
gaggagtttg atggcaacca gttccagaag gcccaaggca tctccgtcct ccatgagatg 180 
atccagcaga ccttccatct cttcagcaca aaggacccat ctgctacttg ggaacagagc 240 
cccccagaaa aatcttccac cgaacttaac cagcagctga acgacctgga agcctgcgtg 300 
atacaggagg ctggggtgga agagactccc ctgatgaatg tggactccat cctggctgtg 360 
aagaaatact tccgaagaat cactctttat ctgacagaga agaaacacag cccttgtgcc 420 
tgggaggttg tcagagcaga aatcatgaga cctttctctt ccccaacaaa cccgcaaaaa 480 
agattaagga ggaaggaa 498 

<210> 74 
<211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID CHI. 3 
<400> 74 

tgtgatctgc ctcagaccca cagccttggt aacaggagga ctttgatgat aatggcacaa 60 
atgggaagaa tctctccttt ctcctgcctg aaggacagac atgactttgg atttcctcag 120 
gaggagtttg atggcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 180 
atccagcaga ccttcaatct cttcagcaca aaggactcat ctgctacttg ggatgagaca 240 
cttctagaca aattctacac tgaactttac cagcagctga atgacctgga agcctgtatg 300 
atgcaggagg ttggagtgga agacactcct ctgatgaatg tggactctat cctgactgtg 360 
agaaaatact ttcgaagaat cactctttat ctgacagaga agaaatacag cccttgtgcc 420 
tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 
agattaagga ggaaggaa 498 

<210> 75 
<211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic DNA 

<220> 

<223> Clone ID CHI. 4 
<400> 75 

tgtgatctgc ctcagaccca cagcctgggt aataggaggg ccttgatact cctggcacaa 60 
atgggaagaa tctctccttt ctcctgcctg aaggacagac atgactttgg attcccccag 120 
gaggagtttg gtggcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg . 180 
atccagcaga ccttcaatct cttcagcaca gaggactcat ctgctgcttg ggatgagacc 240 
ctcctagaca aattctacat tgaacttttc cagcaactga atgacctgga agcctgtgtg 300 
atgcaggagg agagggtggg agaaactccc ctgatgaatg cggactccat cttggctgtg 360 
aagaaatact tccaaagaat cactctttat ctgacagaga agaaatacag cccttgtgcc 420 
tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 
agattaagga ggaaggaa 498 

<210> 76 
<211> 498 
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<212> DNA • 

<213> Artificial Sequence 
<220> 

5 <223> Description of Artificial Sequence: Synthetic DNA 
<220> 

<223> Clone ID CH2 . 1 
10 <400> 76 

tgtgatctgc ctcagaccca cagccttggt aacaggagga ctttgatgat aatggcacaa 60 
atgggaagaa tctctccttt ctcctgcctg aaggacagac atgactttgg atttcctcag 120 
gaggagtttg atggcaacca gtcccagaag gctcaagcca tctctgtcct ccatgagatg 180 
atccagcaga ccttcaatct cttcagcaca aaggactcac ctgctacttg ggatgagaca 240 
15 cttctagaca aattctacac tgaactttac cagcagctga atgacctgga agcctgtatg 300 
atacaggagg ttggggtgga agagactccc ctgatgaatg aggactccat cttggctgtg 3 60 
aagaaatact tccgaagaat cactctctat ctgacagaga agaaatacag cccttgtgcc 420 
tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa 480 
agattaagga ggaaggaa 498 

20 

•<210> 77 
<211> 498 
<212> DNA 

<213> Artificial Sequence 

25 

<220> 

<223> Description of Artificial Sequence: Synthetic DNA 
<220> 

30 <223> Clone ID CH2.2 
<400> 77 

tgtgatctgc ctcagaccca cagccttggt aacaggaggg ccttgatact cctggcacaa 60 
atgggaagaa tctctccttt ctcctgtctg atggacagac atgactttgg atttccccag 120 

35 gaggagtttg atgacaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 180 
atccaacaga ccttcaatct cttcagcaca aaggactcat ctgctacttg ggatgagaca 240 
cttctagaca aattctacac tgaactttac cagcagctga atgacctgga agcctgtatg 300 
atgcaggagg ttggagtgga agacactcct ctgatgaatg tggactctat cctgactgtg 360 
aagaaatact tccgaagaat cactctttat ctgacagaga agaaatacag cccttgtgcc 420 

40 tgggaggttg tcagagcaga aatcatgaga tctttctctt tttcaacaaa cttgcaaaaa "480 
agattaagga ggaaggaa 498 

<210> 78 

<211> 498 

45 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic DNA 

50 

<220> 

<223> Clone ID CH2.3 
<400> 78 

55 tgtgatctgc ctcagaccca cagccttggt aacaggagga ctttgatgat aatggcacaa 60 
atgggaagaa tctctccttt ctcctgcctg aaggacagac atgactttgg atttcctcag 120 
' gaggagtttg atggcaacca gttccagaag gctcaagcca tctctgtcct ccatgagatg 180 
atccagcaga ccttcaatct cttcagcaca aaggactcat ctgctacttg ggatgagaca 240 
cttctagaca aattctacac tgaactttac cagcagctga atgacctgga agcctgtatg 300 
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10 



20 



35 



50 



atgcaggagg ttggagtgga agacactcct ctgatgaatg aggactccat cttggctgtg 360 

aagaaatact tccgaagaat cactctctat ctgacagaga agaaatacag cccttgtgcc 420 

tgggaggttg tcagagcaga aatcatgaga tctctctctt tctcaacaaa cttgcaaaaa 480 

agattaagga ggaaggaa 498 

<210> 79 
<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 



<220> 

15 <223> Clone ID CHl.l 
<400> 79 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu lie 
15 10 15 



Leu Leu Ala Gin Met Gly Arg lie Ser Pro Phe Ser Cys Leu Met Asp 
20 25 30 



Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Asp Asn Gin Phe 
25 35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 

30 Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Asp Glu Thr 
65 70 75 80 



Leu Leu Asp Lys Phe Tyr Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 
85 90 95 

Glu Ala Cys Val He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
100 105 110 



Asn Glu Asp Ser He Leu Ala Val Lys Lys Tyr Phe Arg Arg He Thr 
40 115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

45 Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 



Arg Leu Arg Arg Lys Glu 
165 



<210> 80 
<211> 166 
<212> PRT 
55 <213> Artificial Sequence 

V <220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
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<220> 

<223> Clone ID CHI. 2 
<400> 80 

5 Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
15 10 15 



10 



25 



Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
20 25 30 

Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 



Gin Lys Ala Gin Gly He Ser Val Leu His Glu Met He Gin Gin Thr 
15 50 55 60 

Phe His Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Glu Gin Ser 
65 70 75 80 

20 Leu Leu Glu Lys Phe Ser Thr Glu Leu Asn Gin Gin Leu Asn Asp Leu 

85 90 95 



Glu Ala Cys Val He Gin. Glu .Val Gly Val Glu Glu Thr Pro Leu Met 
100 105 110 

Asn Val Asp Ser He Leu Ala Val Lys Lys Tyr Phe Arg Arg He Thr 

115 120 125 



Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
30 130 135 140 

Arg. Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

35 Arg Leu Arg Arg Lys Glu 

165 



<210> 81 
40 <211> .166 
<212> PRT 

<213> Artificial Sequence 
. <220> 

45 <223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID CHI .3 
50 <400> 81 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Thr Leu Met 
1 5 10 15 

He. Met Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
55 20 25 30 

' Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 * 40 45 
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Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Asp Glu Thr 
5 65 70 75 80 

Leu Leu Asp Lys Phe Tyr Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 
85 90 95 

10 Glu Ala Cys Met Met Gin Glu Val Gly Val Glu Asp Thr Pro Leu Met 
100 105 110 



15 



25 



30 



40 



55 



Asn Val Asp Ser He Leu Thr Val Arg Lys Tyr Phe Arg Arg He Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 



Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
20 145 150 155 160 



Arg Leu Arg Arg Lys Glu 
165 



<210> 82 
<211> 166 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic ammo acid 
<220> 

35 <223> Clone ID CHl.4 
<400> 82 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu He 
15 10 15 



Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
20 25 30 . 



Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Gly Gly Asn Gin Phe 
45 35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 

50 Phe Asn Leu Phe Ser Thr Glu Asp Ser Ser Ala Ala Trp Asp Glu Thr , 
65 70 75 80 



Leu Leu Asp Lys Phe Tyr He Glu Leu Phe Gin Gin Leu Asn Asp Leu 
85 90 95 

Glu Ala Cys Val Met Gin Glu Glu Arg Val Gly Glu Thr Pro Leu Met 
100 105 110 

Asn Ala Asp Ser He Leu Ala Val Lys Lys Tyr Phe Gin Arg He Thr 
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115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

5 

Arg Ala Glu lie Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

Arg Leu Arg Arg Lys Glu 
10 165 



<210> 83 
<211> 166 
15 <212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID CH2.1 



20 



<400> 83 

25 Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Thr Leu Met 
15 10 15 

lie Met Ala Gin Met Gly Arg lie Ser Pro Phe Ser Cys Leu Lys Asp 
20 25 30 

30 

Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
35 50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Asp Glu Thr 
65 . 70 75 80 

40 Leu Leu Asp Lys Phe Tyr Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 

85 90 95 

Glu Ala Cys Met He Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
100 105 110 

45 

Asn Glu Asp Ser He Leu Ala Val Lys Lys Tyr Phe Arg Arg He Thr 
115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
50 130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 

55 Arg Leu Arg Arg Lys Glu 

165 



<210> 84 
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<211> 166 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

<223> Clone ID CH2.2 
<400> 84 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Ala Leu lie 
15 10 15 

15 Leu Leu Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Met Asp 
20 25 30 



10 



20 



35 



45 



50 



Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Asp Asn Gin Phe 
35 40 45 

Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 



Phe Asn Leu Phe Ser Thr Lys .Asp Ser Ser Ala Thr Trp Asp Glu Thr 

25 65 70 75 80 

Leu Leu Asp Lys Phe Tyr Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 

85 90 95 

30 Glu Ala Cys Met Met Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 

100 105 110 



Asn Val Asp Ser He Leu Thr Val Lys Lys Tyr Phe Arg Arg He Thr 

115 120 125 

Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 

130 135 140 



Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
40 145 150 155 . 160 



Arg Leu Arg Arg Lys Glu 
165 



<210> 85 
<211> 166 - 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<220> 

55 <223> Clone ID CH2.3 
^ <400> 85 

Cys Asp Leu Pro Gin Thr His Ser Leu Gly Asn Arg Arg Thr Leu Met 
1 5 ■ 10 15 
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lie Met Ala Gin Met Gly Arg He Ser Pro Phe Ser Cys Leu Lys Asp 
20 25 30 

5 Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Asp Gly Asn Gin Phe 
35 40 45 



10 



25 



40 



55 



Gin Lys Ala Gin Ala He Ser Val Leu His Glu Met He Gin Gin Thr 
50 55 60 

Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Thr Trp Asp Glu Thr 
65 70 75 80 



Leu Leu Asp Lys Phe Tyr Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu 
15 85 90 95 

Glu Ala Cys Met Met Gin Glu Val Gly Val Glu Glu Thr Pro Leu Met 
100 105 110 

20 Asn Glu Asp Ser He Leu Ala Val Lys Lys Tyr Phe Arg Arg He Thr 
115 120 125 



Leu Tyr Leu Thr Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val 
130 135 140 

Arg Ala Glu He Met Arg Ser Phe Ser Phe Ser Thr Asn Leu Gin Lys 
145 150 155 160 



Arg Leu Arg Arg Lys Glu 
30 165 



<210> 86 

<211> 15 

35 <212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic DNA 

<400> 86 ' ■ 

tgcgacttac cacaa 



<210> 87 
45 <211> 26 
<212> PRT 

<213> Artificial Sequence 
<220> 

50 <223> Description of Artificial Sequence: Synthetic amino acid 
<400> 87 

Trp Glu Val Val Arg Ser Glu He Met Arg Ser Phe Ser Tyr Ser Thr 
15 10 15 



Asn Leu Gin Arg Arg Leu Arg Arg Lys Asp 
20 25 



54 



XXID <WO 0125438A2J_> 



wo 01/25438 



PCT/USOO/27781 



<210> 88 

<211> 26 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic amino acid 
<400> 88 

Trp Glu Leu Val Arg Ala Glu He Val Arg Ser Phe Ser Phe Ser Thr 
1 5 10 15 



Asn Leu Asn Lys Arg Leu Arg Lys Lys Glu 
20 25 
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This International Search Report has not been established in respect of certain claims under Article 17{2)(a) for the following reasons: 

1. 1x1 Claims Nos.: ^. * ^ 

because they relate to subject maner not required to be searched by this Aulhonty. namely: 

Although claims 77-80 and claims 81-82, 88 (as far as they concern an in vivo 
method) are directed to a method of treatment of the human/animal body (rule 
39.1 (IV) PCT), the search has been carried out and based on the alleged 
effects of the compound/composition. 
2 mciaimsNos.: 68-69. 71-75, 93-103 all partially 

— because they relate to parts of the International Application that do not comply with the prescritsed requirements to such 
an extent that no meaningful International Search can be carried out. specifically: 

see FURTHER INFORMATION sheet PCT/ISA/210 



bMsatse t°hey are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 
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This International Searching Authority found multiple inventions in this international application, as follows: 

see additional sheet 

1 I — I As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
I — I searchable claims. 

2. I I As all searchable claims could be searched without effort jusiifying an additional fee, this Authority did not invite payment 
— otanyaddftionalfee. 

3 I I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
I — I covers only those claims for which fees were paid, specifically claims Nos.: 



4 No required additional search lees were timely paid by the applicant. Consequently, this International Search Report is 
^ restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 

Claims (1. 3. 4-123) all partially 



Remark on Protest ' Q The additional search fees were accompanied by Ihe applicant's protest. 

j j . No protest accompanied the payment of additional search fees. 
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Claims Nos.: 68-69, 71-75. 93-103 all partially 



Claims 68-69 have been searched in view of claim 70. 

Claims 71-75 have r.een searched in view of claim 76. 

Although claims 93-103, could be at least partially be considered as a 
mere presentation of information, rule 39.1 (V) PCT, the search has been 
carried out as far as possible in our systematic documentation. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

Invention 1. : Claims (1, 3, 4-123) all partially 

Nucleic acid represented by sequence ID no.l encoding a 
polypeptide represented by sequence ID no. 36. Antibody. 
Method of production. Therapeutic uses thereof. 



Inventions 2. to 35. : Claims (1, 3, 

4-123) all partially as far as applicable. 



Nucleic acid represented by sequences ID no. 2 to no. 35 
encoding a polypeptide represented respectively by sequences 
ID no. 37 to no. 70. Antibody. Method of production. 
Therapeutic uses thereof. 



Invention 36. : claims (2, 20-33, 37-40, 
54-123) all partially 



Wucleic actd represented by sequence ID no. 72 encoding a 
polypeptide represented by sequence ID no. 79. Antibody. 
Method of production. Therapeutic uses thereof. 



Invention 37. to 42. : claims (2. 20-33. 37-40, 
54-123) all partially as far as applicable. 



nuclei c acid represented by sequences ID no. 73 to no. 78 
encoding a polypeptide represented respectively by sequences 
ID no. 80 to no. 85. Antibody. Method of production. 
Therapeutic uses thereof. 
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